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(57) Abstract 


DNA fragments from the promoter region of the C. elegans UL6 gene which are capable of functioning as promoters directing 
gene expression in the excretory cell of C. elegans are provided and also expression vectors and transgenic C. elegans containing these 
fragments. Also provided are screening methods performed in C. elegans for identifying compounds or mutations which have an effect on 
the morphology of the excretory canal. Compounds identified using these screening methods may have therapeutic potential in the treatment 
of a range of diseases for which the C. elegans excretory canal serves as a model. 
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Expression of DNA or proteins in C. elecrans . 

The present invention relates to the expression 
of DNA, genes, cDNAs, proteins, peptides and parts 
5 thereof in zhe excretory canal of the nematode worm 
C.elegans. In particular, the invention relates to 
promoter sequences which are capable of directing 
tissue-specific gene expression in the excretory canal 
of C. elegar.s, to expression vectors containing the 

10 promoter sequences, to transgenic C. elegans 
specifically expressing reporter genes in the 
excretory canal, to methods of identifying chemical 
agents that affect the morphology of the excretory 
canal and to use of these agents in the 

15 pharmacological treatment of diseases for which the 
C . elegans excretory canal serves as a model. 

The C.eleoars excretory cell 

The excretory system of the nematode C. elegans 

20 consists of three cells: a single large excretory 

cell, a due: cell and a pore cell that interfaces with 
the duct to the main body hypodermis. The excretory 
cell is the largest mononucleate cell in C. elegans. 
The nucleus and cell body of the excretory cell is 

25 situated at the terminal bulb of the pharynx. The cell 
itself is shaped in an H-form, with the two arms 
situated alcng the lateral lines for almost the entire 
length of the worm, and slightly dorsal. The excretory 
cell is polarized, having an apical domain facing the 

30 lumen of the excretory canal and a basal domain facing 
outside. The structure and the organization of the C. 
elegar.s excretory system suggest that it may be used 
for osmoregulation and can therefore be considered as 
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a model for the vertebrate nephron. 

Various mutant C. eleqans have been reported 
which have an aberrant phenotype in the excretory 
canal. These aberrant phenotypes include cyst 
5 formation, short canals and branched canals. Various 
mutations affecting the excretory canal can be traced 
back in C.elegans II, ed. Riddle, Blumenthal, Meyer 
and Priess, Cold Spring Harbor Laboratory Press, 1997. 

10 Drug discovery in growth cone steering. 

Regulation of cell motility, cell shape and the 
outgrowth of axons or other cell outgrowths are all 
essential processes in the morphogenesis and function 
of both unicellular and multicellular organisms. 

15 Furthermore, the control of these processes is 

disturbed in a variety of diseases in which receptors, 
extra-cellular signals and intra-cellular pathways are 
over- or under-stimulated. The discovery of new genes, 
proteins and peptides that are involved in these 

20 processes and chemical entities which modulate them 
would very much help the understanding of these 
processes. Accordingly, there is a need to develop new 
methods for the discovery of novel molecules involved 
in the cell motility, cell shape and cell outgrowth 

25 process, and to establish their function. In addition, 
since malfunction of these biological processes can 
lead to disease there is also a need to discover 
chemical entities which modulate these processes which 
may be useful as pharmaceuticals. Diseases associated 

30 with cell motility, cell shape and cell outgrowth 

include cancerous disease, more particularly tumour 
formation, metastasis and vascularisation of tumours. 
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Drug discovery in renal diseases. 

In the drug discovery process it is established 
practice to develop a model of a disease which can be 
used in the development of assays to screen for 
5 compounds with potential pharmaceutical activity. For 
kidney diseases, and core specifically kidney cyst 
formation, two different types of disease models 
currently exist; models based on cell cultures of 
renal epithelial cells and mouse models. Although 

10 these systems have been presented as models for cystic 
diseases, such as autosomal dominant polycystic kidney 
disease (ADPKD) , they have several disadvantages. 

The models based on cell cultures can never be 
compared with a live multicellular organism. Where 

15 aberrant growth indicative of cyst formation has been 
observed in cultures of different cells, it has proven 
difficult to develop efficient compound screens from 
these models. Furthermore, even if chemicals can be 
discovered that modulate cell growth and hence cyst 

20 formation in culture, it remains difficult to prove 
that these compounds will have analogous effects in 
the renal systems of multicellular organisms. 

The developed mouse models for renal cyst 
diseases have the disadvantage that they are not 

25 suitable for middle tc high throughput screening for 
the discovery of pharmacological compounds. 
Accordingly, there remains a clear need for an 
alternative model of renal diseases which more 
accurately models the renal systems of multicellular 

30 organisms but which is practical for use in middle to 
high throughput screening. 

The present invention relates to the use of the 
C. elegans excretory cell in the drug discovery 
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process. The C* elegans excretory canal is an efficient 
tool to study various developmental biological 
features; it is formed during the larval stages of the 
nematode and the canals are observed to grow along the 
5 animal in early development. Hence, the development of 
the excretory canal is an efficient tool no study 
growth cone steering and defects that might arise 
during its development and the excretory canal can be 
used as a model for the development of drug screens in 
10 the area of growth cone steering and directional 
outgrowth. 

The C. elegans excretory cell and excretory canal 
can also be considered as a model of the human kidney 
nephron. The excretory canal has analogous apical- 

15 basal polarities as can be found in certain kidney 
cells and which are relevant for cellular function. 
Hence, studying the excretory canal may help to 
develop new tools against kidney diseases. 
Furthermore, the excretory canal can be used as a 

20 model for the development of drug screens in the area 
of kidney diseases. 

In order to exploit the potential of the C. 
elegans excretory cell and excretory canal both as a 
disease model and in the development of drug screens 

25 it would be advantageous to be able to express any 
gene or cDNA of interest, including reporter genes, 
specifically in the excretory cell and excretory 
canal. To achieve this would require the 
identification of a tissue-specific promoter which is 

30 active in the excretory cell. 

The present inventors have identified, through 
the use of biochemical, molecular biology and 
transgenic techniques, a promoter fragment that 
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specifically directs transcription in the C. elegans 
excretory cell in a very efficient way. From this 
promoter fragment several deletions have been 
generated that still promote transcription, and hence 
5 gene expression, in the excretory cell of C. elegans. 
These promoter fragments are useful tools as they can 
be used to direct specific expression of any DNA 
fragment of interest in the excretory cell and 
excretory canal. 

10 Accordingly, in a first aspect the invention 

provides a DNA fragment which is capable of 
functioning as a promoter directing gene expression in 
the excretory cell of C. elegans, which DNA fragment 
comprises the sequence of nucleotides set forth in 

15 Figure 2 or a fragment thereof in the absence of any 
other sequence of consecutive nucleotides from the C. 
elegans genome. 

The invention further provides a DNA fragment 
which is capable of functioning as a promoter 

20 directing gene expression in the excretory cell of C. 

elegans, which DNA fragment comprises the sequence of 
nucleotides set forth in Figure 3 or a fragment 
thereof in the absence of any other sequence of 
consecutive nucleotides from the C. elegans genome. 

25 The invention further provides a DNA fragment 

which is capable of functioning as a promoter 
directing gene expression in the excretory cell of C. 
elegans, which DNA fragment comprises the sequence of 
nucleotides set forth in Figure 4 or a fragment 

30 thereof in the absence of any other sequence of 

consecutive nucleotides from the C. elegans genome. 

The invention further provides a DNA fragment 
which is capable of functioning as a promoter 
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directing gene expression in the excretory cell of C. 
elegans, which DNA fragment comprises the sequence of 
nucleotides set forth in Figure 5 or a fragment 
thereof in the absence of any other sequence of 
5 consecutive nucleotides from the C. elegans genome. 

The invention further provides a DNA fragment 
which is capable of functioning as a promoter 
directing gene expression in the excretory cell of C. 
elegans, which DNA fragment comprises the sequence of 

10 nucleotides set forth in Figure 6 or a fragment 
thereof in the absence of any other sequence of 
consecutive nucleotides from the C. elegans genome. 

The invention further provides a DNA fragment 
which is capable of functioning as a promoter 

15 directing gene expression in the excretory cell of C. 

elegans, which DNA fragment comprises the sequence of 
nucleotides set forth in Figure 7 or a fragment 
thereof in the absence of any other sequence of 
consecutive nucleotides from the C. elegans genome. 

20 DNA fragments having the nucleotide sequences set 

forth in Figures 2 to 7 exhibit tissue-specific 
promoter activity, directing transcription 
specifically in the excretory cell and excretory canal 
of C. elegans. That is to say when a reporter gene 

25 under the control of any one of these DNA fragments is 
introduced into C. elegans a high level of reporter 
gene expression is observed in the excretory cell and 
excretory canal with only background expression in 
other tissues. As will be described below, these 

30 tissue-specific promoters are useful tools in the 

construction of expression vectors which are suitable 
for directing gene expression specifically in the C. 
elegans excretory cell and excretory canal and in the 
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construction of transgenic C. elegans in which the 
transgene is expressed specifically in the excretory 
cell and excretory canal. 

In the context of the present application, the 
5 phrase "in the absence of any further sequence of 

consecutive nucleotides from the C. elegans genome" 
should be interpreted as meaning in the absence of any 
other C. elegans genomic sequence consecutive with the 
sequences shown in Figures 2 to 7, respectively. In 
10 other words, the DNA fragments of the invention 

preferably contain the sequences shown in Figures 2 to 
7 in the absence of any other consecutive UL6 promoter 
sequences . 

In a second aspect the invention provides an 

15 expression vector which is suitable for directing 
tissue-specific expression of a heterologous DNA 
fragment in the excretory cell of C. elegans, the 
vector comprising a promoter comprising a DNA fragment 
as set forth in any one of Figures 2 to 7 positioned 

20 to direct expression of the heterologous DNA fragment. 

The term "heterologous DNA fragment" refers to 
essentially any DNA fragment which it is desired to 
express in the excretory cell of C. elegans. This DNA 
fragment can be a gene, a cDNA or a fragment thereof 

25 from C. elegans, Drosophila sp., mouse, human, 

zebrafish or any other invertebrate or vertebrate 
origin. Alternatively, the DNA fragment may be of 
prokaryotic origin, a recombinant DNA or a synthetic 
DNA fragment. In a preferred embodiment the 

30 heterologous DNA is a reporter gene. Suitable reporter 
genes include those encoding green fluorescent protein 
(including the many GFP variants and equivalents known 
in the art), p-galactosidase, p-lactamase, luciferase, 
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acetohydroxyacid synthase, alkaline phosphatase, p- 
glucuronidase, chloramphenicol acetyltransf erase, 
horseradish peroxidase, nopaline synthase or octapine 
synthase. The above are listed by way of example only 
5 and it is to be understood that the precise nature of 
the heterologous DNA fragment is not material to the 
invention . 

In order to achieve expression in eukaryotic host 
cells (e.g. cells of the nematode worm C. elegans) an 

10 expression vector must include promoter sequences to 
position RNA polymerase at the transcription start 
site and to direct an appropriate frequency of 
transcription initiation at this site (e.g. to direct 
tissue-specific expression in the C. elegans excretory 

15 cell) . In accordance with the invention, the promoter 
region of the expression vector may comprise UL6 
promoter sequences which fulfill both functions (i.e. 
which contain the transcription initiation site for 
binding of RNA pol and which direct tissue-specific 

20 expression) or the promoter region of the vector may 
comprise a minimal promoter region from an 
heterologous gene (e.g. the pes-10 promoter) which 
functions to position RNA polymerase at the 
transcription initiation site and possibly to direct a 

25 basal level of transcription and UL6 promoter 

sequences to direct a tissue-specific expression 
pattern. This will be further understood with 
reference to the accompanying Examples. The vector 
might further comprise one or more additional 

30 transcriptional regulatory elements (e.g. enhancer 

elements) in addition to the UL6 promoter sequences. 

The expression vector may also include the 
following elements required for eukaryotic gene 
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expression: a terminator sequence and downstream 
polyadenylation signal for transcription termination, 
translation initiation sequences for ribosome binding, 
a start codon (usually AUG) and a termination codon 
5 for detachment of the ribosome. Such vectors may be 
obtained commercially or may be assembled from the 
elements described by methods well known in the art. 

Examples of expression vectors according to the 
invention are plasmids, viral or phage vectors, 

10 plasmid vectors being preferred for use in C. elegans. 
Such vectors will normally possess one or more 
selectable markers, such as a gene for antibiotic 
resistance. Plasmid vectors, including those designed 
for expression in C. elegans, may also contain a 

15 bacterial origin of replication to allow replication 
in bacterial host cells for cloning purposes. The 
construction of plasmid vectors suitable for directing 
expression of a reporter gene in the excretory cell of 
C. elegans are described in detail in the accompanying 

20 Examples . 

Also provided by the invention are host cells and 
organisms transformed or transfected with the 
expression vector . 

In a still further aspect the invention provides 

25 a transgenic C. elegans containing a transgene 

comprising a promoter which is capable of directing 
tissue-specific gene expression in the excretory cell 
of C. elegans operatively linked to a protein-encoding 
DNA fragment. 

30 According to the invention the transgene may 

comprise any promoter which is capable of directing 
tissue-specific gene expression in the excretory cell 
of C. elegans operatively linked to any DNA fragment 
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which it is desired to express in the excretory cell 
and excretory canal of t. elegans. Where a promoter 
is described herein as being capable of or suitable 
for directing tissue-specific gene expression in the 
5 excretory cell and/or excretory canal this should be 
taken to mean that the promoter directs a relatively 
high level of expression in the excretory cell and/or 
excretory canal and only background expression in 
other tissues. In a preferred embodiment the promoter 

10 comprises a DNA fragment, as described above, 

comprising a sequence of nucleotides as set forth in 
any one of Figures 2 to 7 in the absence of any other 
consecutive sequence of nucleotides from the C. 
elegans genome, i.e. in the absence of any consecutive 

15 UL6 promoter sequences. The promoter may, if 

appropriate, contain sequences from elsewhere in the 
C. elegans genome which are not consecutive with the 
sequences shown in Figures 2 to 7, for example one or 
more further cis-acting regulatory elements isolated 

20 from a different type of promoter. 

In this context the term "transgene" refers to a 
DNA construct comprising a promoter operatively linked 
to a protein-encoding DNA fragment. The construct may 
contain additional DNA sequences in addition to those 

25 specified above. The transgene may, for example, form 
part of a plasmid vector. By the term "operatively 
linked" it is to be understood that the promoter is 
positioned to drive transcription of the protein- 
encoding DNA fragment. 

30 A transgenic C. elegans according to this aspect 

cf the invention may be constructed according to any 
cf the standard techniques known to those skilled in 
the art. A suitable approach involves the construction 


WO 00/54815 


PCT/EP00/02373 


- 11 - 


of a plasmid-based expression vector in which a 
protein-encoding DNA of interest is cloned downstream 
of a promoter . capable of directing tissue-specific 
gene expression in the excretory cell of C. elegans. 
5 The plasmid vector is then injected into N2 nematodes. 
In order to facilitate the selection of transgenic 
nematodes a second plasmid carrying a dominant 
selectable marker may be co-injected with the 
experimental plasmid. 

10 The plasmid vector is maintained in cells of the 

transgenic C. elegans in the form of an 
extrachromosomal array. Although plasmid vectors are 
relatively stable as extrachromosomal arrays they can 
alternatively be stably integrated into the C. elegans 

15 genome using standard technology, for example, using 
gamma ray-induced integration of extrachromosomal 
arrays (methods in Cell Biology, Vol 48 page 425-480) . 

The protein-encoding DNA fragment can be any DNA 
fragment which it is desired to express in the 

20 excretory canal of C. elegans . This DNA fragment can 
be a gene, a cDNA or a fragment thereof from 
C. elegans, Drosophila sp., mouse, human, zebrafish or 
any other invertebrate or vertebrate origin. 
Alternatively, zhe DNA fragment may be of prokaryotic 

25 origin, a recombinant DNA or a synthetic DNA fragment. 

In a preferred embodiment the DNA fragment is a 
promoterless reporter gene encoding a marker protein 
such as, for example, green fluorescent protein (GFP) , 
(J-galactosidase, (5- lactamase, lucif erase, 

30 acetohydroxyacid synthase, alkaline phosphatase, 3- 
glucuronidase, chloramphenicol acetyltransf erase, 
horseradish peroxidase, nopaline synthase or octapine 
synthase. The expression of a marker such as GFP makes 
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it possible to visualize the excretory cell in situ in 
the body of the worm without intervening biochemical 
steps or specialized equipment such as noraarski- 
microscopy. A GFP-expressing excretory cell can be 
5 readily visualized using binocular microscopy 
following UV irradiation of the worm. 

According to the invention, the transgenic 
C. elegans can be of any genetic background, for 
example, it can be a wild type worm, a selected mutant 

10 worm or a transgenic worm. A stably integrated 

transgene can easily be transferred onto a different 
genetic background by performing a genetic cross 
between a first parental C. elegans strain containing 
a stably integrated transgene and a second parental C. 

15 elegans strain of the desired genetic background. 

Standard C. elegans genetics can be employed for this 
purpose. The genetic background of the worm generally 
has no effect on the expression of the transgene in 
the excretory cell and excretory canal. 

20 In a further embodiment of the invention the 

transgenic C. elegans further comprises a second 
transgene comprising a promoter which is suitable for 
directing tissue-specific expression in the excretory 
cell of C. elegans operatively linked to a reporter 

25 gene. 

This ^double transgenic' C. elegans can be 
constructed by co-injecting C. elegans with two 
plasmid expression vectors; one containing the 
protein-encoding DNA fragment of interest and the 
30 other containing a reporter gene following the 

procedure described above. Both of the plasmid vectors 
can be stably integrated into the C. elegans genome 
using standard techniques (methods in Cell Biology, 
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Vol 48 page 425-480) . 

In a further aspect the invention provides a 
method of identifying a mutation in a gene involved in 
5 growth cone steering, cell motility, cell shape, 
tumour formation, metastasis, vascularisation of 
tumours, the development of the excretory canal, 
cytoskeletal organisation, surface to cytoskeleton 
signalling, renal development or kidney disease, which 

10 method comprises contacting a transgenic C. elegans 

which expresses a reporter gene in the excretory canal 
with a mutagen and screening for phenotypic changes in 
the excretory canal. 

Suitable mutagens for use in the method of the 

15 invention include EMS, X-rays or the UV-TMP method, 
all of which are known to those skilled in the art. 

Following contact with mutagen the transgenic C. 
elegans are maintained in culture for at least two 
subsequent generations during which time observations 

20 of the morphology of the excretory canals of the 

progeny are made in order to identify any mutants with 
an abnormal excretory canal phenotype. Suitable 
culture conditions are described in the examples given 
herein. 

2 5 The expression of a reporter gene such as GFP in 

the excretory canal allows mutations that affect the 
development, shape, growth direction and outgrowth of 
the excretory canal to be observed and selected. The 
affected gene is then isolated and characterized using 

30 standard genetic and molecular biology techniques. 

In a preferred embodiment of the method of the 
invention the transgenic C. elegans which express a 
reporter gene in the excretory canal contains a 
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transgene comprising a promoter which is suitable for 
directing tissue-specif ic gene expression in the 
excretory canal of C. elegans operatively linked to a 
reporter gene. The promoter preferably comprises a DNA 
5 fragment having the sequence of nucleotides set forth 
in any one of Figures 2 to 7. 

The method of the invention may also be adapted 
for use in the identifying further components of a 
biochemical pathway involved in growth cone steering, 

10 cell motility, cell shape, tumour formation, 
metastasis, vascularisation of tumours, the 
development of the excretory canal, cytoskeletal 
organisation, surface to cytoskeleton signalling, 
renal development or kidney disease. In this case the 

15 mutagenesis method as described above is performed on 
a transgenic C. elegans expressing a reporter gene in 
the excretory cell whose genetic background is a 
selected mutant strain. The selected mutant strain is 
a strain carrying a defined mutation in a gene 

20 involved in growth cone steering, cell motility, cell 
shape, tumour formation, metastasis, vascularisation 
of tumours, the development of the excretory canal, 
cytoskeletal organisation, surface to cytoskeleton 
signalling, renal development or kidney disease. 

25 The selected mutant strain may be the result of a 

previous round of random mutagenesis performed on a 
wild-type C. elegans strain or it may be a known 
mutant strain, for example a knock-out mutant or an 
over-expressing strain taken from a C. elegans 

30 collection (e.g. the C. elegans mutant collection at 
the C. elegans Genetic Center, University of 
Minnesota, St Paul, Minnesota, USA) . Methods for 
creating mutant worms with mutations in selected C. 
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elegans genes are known in the art, for example see J. 
Sutton and J. Hodgkin in x The Nematode Caenorhabditis 
elegans' Ed. by William B. Wood and the Community of 
C. elegans Researchers CSHL, 1988 594-595; Zwaal et 
5 al. 'Target-Selected Gene Inactivation in 

Caenorhabditis elegans by using a Frozen Transposon 
Insertion Mutant Bank' 1993, Proc. Natl. Acad. Sci. 
USA 90 pp7431-7435; Fire et al. 'Potent and Specific 
Genetic Interference by Double-Stranded RNA in 

10 Caenorhabditis elegans' 1998, Nature 391 860-811. 

Further treatment: of a selected mutant strain 
with mutagen results in the production of a double 
mutant but it is the phenotype of the later mutation 
(i.e. that resulting from contact with mutagen) which 

15 is scored by screening the subsequent generation for 
further morphological changes in the excretory canal. 
If the phenotype of the selected mutant is enhanced in 
the progeny after mutagenesis this indicates that the 
second mutation has occurred in a gene which acts on 

20 the same or a parallel biochemical pathway to the gene 
affected by the defined mutation. Alternatively, if 
the phenotype of the selected mutant is suppressed in 
the double-mutant progeny this indicates that the 
second mutation event has occurred in an important 

25 gene in the biochemical pathway. 

Transgenic C. elegans for use in this method, or 
for use in any of the subsequently described methods 
which require the use of transgenic C. elegans 
expressing protein encoding DNAs and/or reporter genes 

30 in the excretory cell and excretory canal, may be 

constructed according ro standard techniques known in 
the art such as the methodology described by Craig 
Mello and Andrew Fire, Methods in Cell Biology, Vol 48 
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Ed. H. F. Epstein and D. C. Shakes, Academic Press, 
pages 452-480. 

The present invention further provides a method 
5 of determining whether a compound is an inhibitor or 

an enhancer of the regulation of growth cone steering, 
cell motility, cell shape, tumour formation, 
metastasis, vascularisation of tumours, renal 
development, pathways involved in kidney disease, 

10 development of the excretory canal, cytoskeletal 

organisation or surface to cytoskeleton signalling, 
which method comprises contacting a sample of the 
compound with a transgenic C. elegans expressing a 
reporter gene in the excretory canal and screening for 

15 phenotypic changes in the excretory canal. 

Contact with a compound which is an inhibitor or 
an enhancer of the regulation of growth cone steering, 
cell motility, cell shape, tumour formation, 
metastasis, vascularisation of tumours, renal 

20 development, pathways involved in kidney disease, 
development of the excretory canal, cytoskeletal 
organisation or surface to cytoskeleton signalling 
results in changes the morphology of the excretory 
canal. The expression of a reporter gene in the 

25 excretory canal allows these changes in morphology to 
be easily visualized. Commonly observed abnormal 
excretory canal morphologies include: nematodes having 
shorter or longer canals, nematodes having curved or 
extra branched canals, nematodes having ventral or 

30 dorsal canals, nematodes having more or less than two 
canals, nematodes having wrongly branched canals, 
nematodes having vacuoles or cysts, nematodes with 
unusual features in the excretory canal. 
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Preferred reporter genes include those encoding 
green fluorescent protein, (3-galactosidase, p- 
lactamase, luciferase, acetohydroxyacid synthase, 
alkaline phosphatase, p-glucuronidase, chloramphenicol 
5 acetyltransf ersae, horseradish peroxidase, nopaline 
synthase or octapine synthase. In a preferred 
embodiment the transgenic C. elegans which express a 
reporter gene in the excretory canal contains a 
transgene comprising a promoter which is suitable for 

10 directing tissue-specific gene expression in the 

excretory canal of C. elegans operatively linked to a 
reporter gene. The promoter preferably comprises a DNA 
fragment having the sequence of nucleotides set forth 
in any one of Figures 2 to 7 . The genetic background 

15 of the transgenic worm can be wild-type, alternatively 
the worm can be a mutated worm or a worm expressing a 
second transgene. 

For the avoidance of doubt, it is hereby stated 
that although the above-described method, and similar 

20 screening methods described hereinbelow, are based on 
bringing C. elegans worms into contact with compounds 
which may potentially have useful pharmacological 
activity there is no intention to provide any 
therapeutic benefit to the C. elegans during the 

25 screen. The worms are used merely as a tool to find 
out something about the properties of a compound in a 
biological system. In principle, this is similar to 
carrying out compound screening in vitro using 
mammalian cells except that the biological system is a 

30 microscopic multicellular organism rather than a 
single cell. 

It will be appreciated that a wide variety of 
candidate compounds may be tested using the screening 
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methods described herein. The compound may be of any 
chemical formula and may be one of known biological or 
pharmacological activity, a known compound without 
such activity or a novel molecule such as might be 
5 present in a combinatorial library of compounds. 

The invention further provides a compound which 
is identifiable using the above method as an inhibitor 
or an enhancer of the regulation of growth cone 
steering, cell motility, cell shape, tumour formation, 

10 metastasis, vascularisation of tumours, renal 

development, pathways involved in kidney diseases, 
development of the excretory canal, cytoskeletal 
organisation or surface to cytoskeleton signalling. 

The invention also provides use of this compound 

15 as a medicament, or in the manufacture of a 

medicament, for promoting neuronal regeneration, re- 
vascularisation or wound healing or for the treatment 
of chronic neuro-degenerat ive diseases, tumour 
formation, metastasis, tumour vascularisation, kidney 

20 diseases, polycystic kidney diseases (specifically 
ADPKD) , cell migration diseases or immunological 
diseases. Also provided by the invention is a 
pharmaceutical composition comprising the compound 
plus a pharmaceutical^ acceptable carrier, diluent or 

25 excipient. 

The present invention further provides a method 
of determining the function of the protein encoded by 
a DNA fragment, which method comprises the steps of; 
30 (a) providing a transgenic C. elegans containing 

a transgene comprising a promoter which is suitable 
for directing tissue-specific gene expression in the 
C. elegans excretory cell operatively linked to the 
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said DMA fragment; and 

(b) observing any phenotypic changes in the 
excretory canal of the transgenic C. elegans. 

5 According to the method of the invention the 

function of a given protein or peptide may be studied 
by expressing the protein in the excretory canal of C. 
elegans. Analysis of any resultant phenotypic changes 
in the excretory canal may result in a better 

10 understanding of the function of the protein. 

The transgene comprises a promoter which directs 
transcription specifically in the C. elegans excretory 
cell and excretory canal. The DNA fragment encoding 
the protein of interest, which DNA fragment may be a 

15 genomic DNA, a cDNA or a fragment thereof, is placed 
under the control of this promoter and thereby 
expressed specifically in the excretory cell and 
excretory canal. In a preferred embodiment the 
promoter comprises a DNA fragment having the sequence 

20 of nucleotides set forth in any one of Figures 2 to 7 . 

In order to help visualise any phenotypic changes 
in the excretory canal the transgenic C. elegans may 
further comprise a second transgene comprising a 
promoter suitable for directing tissue-specific gene 

25 expression in the C. elegans excretory cell 

operatively linked to a reporter gene, preferably 
green fluorescent protein, p-galactosidase , [}- 
lactamase, luciferase, acetohydroxyacid synthase, 
alkaline phosphatase, p -glucuronidase, chloramphenicol 

30 acetyltransf erase, horseradish peroxidase, nopaline 
synthase or octapine synthase. As described above, 
expression of a reporter gene such as GFP makes it 
possible to visualise the excretory canal without the 


need for specialized equipment. 

If expression of a given protein in the excretory 
cell results in an abnormal excretory canal phenotype 
then in a further embodiment the method of the 
invention may be combined with a mutagenesis step in 
order to identify further components of the 
biochemical pathway on which the protein acts. In this 
embodiment a transgenic C. elegans expressing the 
protein in the excretory canal is contacted with a 
mutagen, for example, EMS, UV-TMP or X-rays, all of 
which are well known in the art, and then maintained 
in culture for at least two subsequent generations 
during which time the morphology of the excretory 
canal is observed. If contact with mutagen results in 
either enhancement or suppression of the abnormal 
excretory canal phenotype then this indicates that a 
mutation has occurred in a gene encoding a component 
of the same biochemical pathway to that on which the 
original protein acts or of a parallel biochemical 
pathway. The mutated gene can then be isolated and 
characterised using standard molecular biology and 
biochemical techniques . 

The invention further provides a method of 
determining whether a compound is a modulator of 
growth cone steering, cell shape, cell motility, 
tumour formation, metastasis, vascularisat ion of 
tumours, renal development, a pathway involved in 
kidney disease, development of the excretory canal, 
cytoskeletal organisation or surface to cytoskeleton 
signalling, which method comprises the steps of; 

a) contacting a sample of the compound with a 
transgenic C. elegans expressing a DNA fragment 
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encoding a protein involved in the regulation of 
growth cone steering, cell shape, cell motility, renal 
development or a pathway involved in kidney disease, 
which transgenic C. elegans contains a transgene 
5 comprising a promoter which is suitable for directing 
tissue-specific gene expression in the C. elegans 
excretory cell operatively linked to the said DNA 
fragment; and 

(b) screening for phenotypic changes in the 
10 excretory canal. 

The transgenic C. elegans may be a wild type 
strain or a selected mutant strain. In one embodiment 
the transgenic worm has an abnormal excretory canal 
15 phenotype. Alternatively, the DNA fragment expressed 
in the excretory cell of the transgenic worm rescues 
an abnormal excretory canal phenotype which is present 
in the genetic background of the transgenic C. 
elegans . 

20 Visible phenotypic changes in excretory canal 

morphology may include: nematodes having shorter or 
longer canals, nematodes having curved or extra 
branched canals, nematodes having ventral or dorsal 
canals, nematodes having more or less than two canals, 

25 nematodes having wrongly branched canals, nematodes 
having vacuoles or cysts, nematodes with unusual 
features in the excretory canal. To assist in 
visualising any phenotypic changes the transgenic worm 
may further express a reporter gene, preferably 

30 encoding green fluorescent protein, in the excretory 
canal . 

The present invention further provides a compound 
which is identifiable as a modulator of growth cone 
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steering, cell shape, ceil motility, renal 
development, a pathway involved in kidney disease, 
development of the excretory canal, cytoskeletal 
organisation or surface to cytoskeleton signalling 
5 according to the above method. The invention also 

provides use of this compound as a medicament, or in 
the manufacture of a medicament, for promoting 
neuronal regeneration, re-vascularisation or wound 
healing, or for the treatment of chronic neuro- 

10 degenerative diseases, metastasis, kidney diseases, 
kidney cyst formation, polycystic kidney diseases 
(specifically ADPKD) , cell migration diseases or 
immunological diseases. The compound may be provided 
as a pharmaceutical composition comprising the 

15 compound and a pharmaceutically acceptable carrier, 
diluent or excipient. 

In a still further aspect the invention provides 
a method of identifying further components of a 

20 biochemical pathway on which a compound identifiable 
as a modulator of growth cone steering, cell shape, 
tumour formation, metastasis, vascularisation of 
tumours, cell motility, renal development, a pathway 
involved in kidney disease, development of the 

25 excretory canal, cytoskeletal organisation or surface 
to cytoskeleton signalling may act, which method 
comprises the steps of: 

(a) providing a transgenic C. elegans which 
expresses a reporter gene in the excretory canal; 

30 (b) contacting the transgenic C. elegans with a 

mutagen; 

(c) contacting the mutated C. elegans with a 
compound which is identifiable as a modulator of 


WO 00/54815 


PCT/EP00/02373 


- 23 - 


growth cone steering, cell shape, tumour formation, 
metastasis, vascular isation of tumours, cell motility, 
renal development, a pathway involved in kidney 
disease, development of the excretory canal, 
5 cytoskeletal organisation or surface to cytoskeleton 
signalling; and 

(d) screening for phenotypic changes in the 
excretory canal. 

Following contact with mutagen and the compound 

10 the transgenic C. elegans are maintained in culture 
for at least two subsequent generations during which 
time observations of the morphology of the excretory 
canals of the progeny are made in order to identify 
any changes in excretory canal phenotype. As with the 

15 other screening methods hereinbefore described the 

expression of a reporter gene, such as GFP, makes it 
easy to visualise any phenotypic changes in the 
excretory canal. 

The compound used in this method is one which is 

20 identifiable as a modulator of growth cone steering, 
cell shape, tumour formation, metastasis, 
vascularisation of tumours, cell motility, renal 
development, a pathway involved in kidney disease, 
development of the excretory canal, cytoskeletal 

25 organisation or surface to cytoskeleton signalling 
using one of the methods hereinbefore described for 
that purpose. As described above, treatment of C. 
elegans with such a compound produces morphological 
abnormalities in the excretory canal. Suitable 

30 mutagens for use in the method of the invention 

include EMS, X-rays or the UV-TMP method, all of which 
are known to those skilled in the art. 

The use of mutagenesis in the method of the 
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invention facilitates the identification of further 
components of the biochemical pathway on which the 
compound acts. If the mutagenesis step results in the 
production of mutant progeny in which the abnormal 
5 excretory canal phenotype is enhanced (as compared 
with the phenotype observed following treatment of 
non-mutated C. elegans with the same compound) then 
the mutation has occurred in a gene in the same 
biochemical pathway as that on which the compound acts 

10 or a parallel pathway. Alternatively, treatment of C. 
elegans according to the method results in mutant 
progeny in which the abnormal excretory canal 
phenotype is suppressed, indicating that a mutation 
has occurred in a gene having an important function on 

15 the biochemical pathway on which the compound acts. In 
either case the gene affected by the mutation can be 
isolated and characterised using standard molecular 
biology and biochemical techniques. 

In a preferred embodiment of the method of the 

20 invention the transgenic C. elegans which express a 
reporter gene in the excretory canal contains a 
transgene comprising a promoter which is suitable for 
directing tissue-specific gene expression in the 
excretory canal of C. elegans operatively linked to a 

25 reporter gene. The promoter preferably comprises a DNA 
fragment having the sequence of nucleotides set forth 
in any one of Figures 2 to 7 . The genetic background 
of the transgenic C. elegans may be wild type or it 
may be a selected mutant strain. 

30 


The invention will be further understood with 
reference to the following Examples, together with the 
accompanying Figures in which: 
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Figure 1 shows the nucleic acid sequence of the insert 
of pUL6#64Al. 

Figure 2 shows the nucleic acid sequence of the insert 
5 of pGF2002. 

Figure 3 shows the nucleic acid sequence of the insert 
of pGF2Q03. 

10 Figure 4 shows the nucleic acid sequence of the insert 
of pGF2006. 

Figure 5 shows the nucleic acid sequence of the insert 
of pGF2008. 

15 

Figure 6 shows the nucleic acid sequence of the insert 
of pGF2009. 

Figure 7 shows the nucleic acid sequence of the insert 
20 of pGF2012. 

Figure 8 shows a restriction map of the LJL6 fragment. 

Figure 9 is an overview of the series of plasmids 
25 containing deletion fragments of UL6. 


Figure 10 is a plasmid map of pGF2006. 

Figure 11 is a plasmid map of pGF2009. 

Figure 12 is a plasmid map of pGF2013. 

Figure 13 is a plasmid map of pGF2014, 


30 


General Experimental methods . 

Sequence information and expression patterns were 
obtained from the C. elegans genome project, the 
Sanger Centre, and Washington University School of 
medicine, http/www . sanger . ac . uk/pro jects/C-elegans . 

All Molecular biology work was performed using 
standard techniques known in the art, as described by 
Sambrook et al. (1989), Molecular Cloning: A 
Laboratory Manual, Cold Spring Harbor Laboratory 
Press; F. M. Ausubel et al. (eds. ) or Current 
Protocols in Molecular Biology, John Wiley & Sons, 
Inc. (1994), or using minor modifications of the 
methods described therein. 

All manipulations of C . elegans worms were 
performed using techniques described in Methods in 
Cell Biology, vol 84; Caenorhabditis elegans : modern 
biological analysis of an organism, ed. Epstein and 
Shakes, academic press, 1995, or using minor 
modifications of the methods described therein. 

Transgenic C. elegans strains were constructed by 
injection of plasmid DNA into N2 worms using standard 
techniques known in the art (see Methods in Cell 
Biology, vol 84 as mentioned above) . In order to 
facilitate the selection of transgenic strains the 
plasmid pRF4 (Mello, C. C. et al. EMBO J. 10, 3959- 
3970 (1991)) which carries the rol-6 gene was co- 
injected with the experimental plasmids as a marker. 
C. elegans expressing rol-6 exhibit the roller 
phenotype. Any other C. elegans dominant selectable 
phenotypic marker could be used in place of rol-6 with 
equivalent effect. When generating transgenic C. 
elegans strains with plasmids that encode for and 
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express GFP or GFP fusion proteins, no co-injection 
with a dominant selective marker is needed, as the 
transgenic lines can be selected by simple isolation 
of the progeny that expresses GFP. 

5 

Example 1 

GFP expression in the excretory canal using a minimal 
promoter fragment. 

pUL#64Al was isolated as the result of a promoter 

10 trapping experiment described by Young J.M. and Hope 
I. A. Molecular markers of differentiation in 
Caenorhabditis elegans obtained by promoter trapping 
(1993) Dev. Dyn. f 196:124-132. In this study partial 
Sau3AI restriction enzyme fragments of C. elegans 

15 genomic DNA were cloned in the BamHI restriction site 
of the vector pPD22.11 (described by Fire A, Harrison 
S.W., and Dixon D. A modular set of LacZ fusion 
vectors for studying gene expression in Caenorhabditis 
elegans (1990) Gene 93:189-198.) creating LacZ 

20 translat ional fusions. Introduction of pUL#64Al into 
C . elegans resulted in the expression of 
galactosidase in the excretory cell and excretory 
canal and the lateral nuclei of the hypodermis 
adjacent to the anterior and posterior of the 

25 excretory cell. The region of the genomic DNA insert 
immediately adjacent to the lacZ gene was sequenced 
enabling the origin of the insert of pUL#64Al to be 
localized on the C. elegans physical genome map. 

In order to determine the length of the cloned 

30 insert the vector pUL#64Al was digested with several 
restriction enzymes, including A'hol , Sail, SphI and 
Hindlll . Restriction fragment length analysis showed 
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that the cloned fragment in pULS64Al was approximately 
6 kb in length. Analysis of the genomic DNA of 
C.elegans (available from the C.elegans genome 
project, the Sanger Centre, and Washington University 
5 School of medicine, http/www. sanger . ac. uk/pro jects/C- 
elegans) revealed the presence of two adjacent Sau3AI 
in the region of the pUL#64Al insert- This confirmed 
that the inserted DNA fragment had a length of 6065 bp 
or 6023 bp and could be located between positions 

10 33561 and 39620 on cosmid C17H12. The inserted DNA 
fragment was designated UL6. 

Analysis of the nucleotide sequence of the UL6 
insert revealed the presence of two putative genes 
orientated in opposite directions. The putative 

15 promoter located upstream of the gene orientated in 
the same direction as the lacZ reporter gene was 
considered to be the most relevant for further 
analysis . 

In order to test for promoter activity several 
20 deletion fragments of UL6 spanning this region were 

cloned in the GFP-fusion vector pPD95.79 (constructed 
by Andrew Fire, Carnegie Institute of Washington and 
freely available at 

ftp://stein.cshl.org/pub/elegans_vector/) . The promoter 
25 activity of each of the fragments was then tested by 
injecting the plasmids into C. elegans and analysing 
the levels of GFP expression in different tissues. 

A first construct, designated pGF2002, contained 
the 3.2 kb Hindlll-MscI fragment of UL6. Following 
30 injection of pGF2002 into C. elegans worms, the Fl 
generation showed GFP expression in the head, tail, 
muscles and excretory canal, whilst the F2 generation 
(and subsequent generations) expressed GFP in the 
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excretory canal with background expression in other 
ceils. The background expression in the other cells is 
mosaic and depending from animal to animal the 
pharynx, the gut, muscle cells or the tail may show 
5 background GFP expression. The resulting strain was 
designated UG266 (bgEx34). 

A smaller construct containing a 2.2 kb Sphl-MscI 
fragment of UL6 in pPD95.79 was made and designated 
pGF2003. Following injection of pGF2003 into C . elegans 

10 worms, the Fl showed GFP expression in the head, tail, 
muscles and excretory canal, analogous to pGF2002. The 
expression of GFP in the F2 generation was not 
analysed but is expected to be analogous to that 
observed with pGF2002, i.e. strong expression in the 

15 excretory canal with background expression in other 
tissues . 

Finally, a 0.9 kb NruI-MscI fragment of UL6 was 
cloned into pPD95.79 digested with MscI and Xbal , the 
later made blunt with Klenow polymerase. After 
20 injection of the resultant piasmid, designated 

pGF2001, into C. elegans no GFP expression could be 
observed. 

These results and the fact that the excretory 
cell-specific promoter in the UL6 fragment was thought 

25 to be localized between the two genes, being 

approximately between the Nrul and the SphI site, 
stimulated the inventors to analyse further deletion 
constructs. The aim was to determine the smallest 
fragment of UL6 that promotes the transcription and 

30 expression in the C. elegans excretory cell and 
excretory canal. 

The 1.3 kb NruI-SphI insert of pGF2003 was 
excised and cloned into the vector pPD97.78. The 
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latter vector is analogous to pPD95.79, but contains a 
minimal pes-10 promoter upstream of the GFP gene. This 
means that it is not necessary to make a fusion 
construct between the fragment to be tested for 
5 promoter activity and GFP. Vector pPD97.78 was 
digested with SphI and Hindll and the resulting 
plasmid was designated pGF2006. Transgenic C. elegans 
animals harboring this plasmid from the F2 and 
subsequent generations show strong GFP expression in 

10 the excretory canal with only minor background 

expression in other cell types. The new C. elegans 
strain was designated UG267 (bgEx35 ) . 

A 1.7 kb MscI-SacI fragment of pGF2003 was cloned 
in pPD95.79 and designated pGF2008. After injection of 

15 this plasmid into C. elegans worms the Fl generation 
showed GFP expression in the excretory canal, gut, 
head, tail and spermatheca. 

The region that promotes the expression of GFP in 
the excretory canal seemed to be located in the 1.3 kb 

20 WruI-SphI insert of pGF2006. Further deletion analysis 
was therefore carried out to identify the minimal UL6 
promoter in this fragment. Plasmid pGF2009 was 
generated by deleting a 0.5 kb Bpil-Hindlll fragment 
of pGF2006, leaving a 0.9 kb fragment. The Hindi 1 1 

25 site is part of the multiple cloning site of the 

vector and adjacent to the SphI site. Plasmid pGF2010 
was generated by deleting a 0 . 9bp Xbal-Bpil fragment 
of pGF2006, leaving a 0.5 kb fragment. The Xbal site 
is part of the multiple cloning site of the vector and 

30 adjacent to the Nrul-Hindll fusion. The construction 

of this series of deletions will be further understood 
with reference to Figure 8 which shows a restriction 
map of the UL6 insert and Figure 9 which gives an 
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overview of the UL6 deletion constructs. 

C. elegans injected with pGF2009 showed GFP 
expression in the excretory canal, with only minor 
background expression in the other cells, analogous to 
5 the expression results with pGF2006. The resulting C. 
elegans strain was designated UG271 (bgEX38) . No GFP 
could be detected in the Fl generation after 
introduction of pGF2010 into C. elegans. 

Two further deletions have been constructed to 

10 define the minimal promoter capable of directing 
transcription in the excretory canal. pGF2009 was 
digested with restriction enzyme Bsal and made blunt 
with klenow polymerase. The resulting fragment was 
further digested with Xjbal to give a 0.4 kb Bsal-Xjbal 

15 fragment and with Hindlll to give a 0.5 kb Hindlll- 
Bsal fragment. These fragments were independently 
cloned in pPD97.78, using the Hindlll-StuI and Stul- 
Xbal sites, respectively. The resulting vectors are 
designated pGF2011 and pGF2012. Injection of pGF2012 

20 into C. elegans resulted in expression of GFP in the 
excretory canal. 

Plasmid expression vectors suitable for 
expressing GFP or lacZ or any other reporter protein 
25 in the excretory canal of C. elegans can be integrated 
in the genome of the worm using standard technology . 
The vectors pGF2006, pGF2009 and pGF2012 are 
particularly suitable for this purpose. 


30 


Example 2 

Methods of screening for new mutations. 

To develop a screen for new mutants having a 
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"short canals" phenotype a C. elegans strain which 
contains any of the above piasmids that express GFP in 
the excretory canal can be used. To develop a screen 
for new mutants with a "ventral canals" phenotype a 
5 worm strain with a stably integrated GFP-expressing 

plasmid is crossed with a C. elegans strain having the 
UNC phenotype, such as C. elegans strain MT152, unc-53 
(nl52) (obtained from Dr Bob Horvitz MIT, Cambridge 
MA, USA) . F2 worms with short canals (UNC phenotype) 
10 are used for further screening. 

General protocol for mutagenesis . 

A few thousand adult worms of the desired genetic 
background are treated with a hypochlorite solution to 

15 get a synchronised culture. This culture is then 

mutagenized using the EMS technique when the worms 
have reached the L4 stage (protocol for mutagenesis is 
as described in "Methods in Cell Biology, Vol 48 page 
31-35") . As an alternative to EMS the UV-TMP technique 

20 can be used. In this method the worms are contacted 
with tri-methyl-psoralen and then treated with UV 
radiation, as described in Methods in Cell Biology, 
Vol 48. Caenorhabditis elegans: Modern biological 
analysis of an organism. Eds H. F. Epstein and D. C. 

25 Shakes, Academic Press. F2 worms are analysed for 

aberrant excretory canal phenotypes, which can easily 
be visualised due to the expression of GFP. 

Observations of excretory canal morphology are 
30 made for two subsequent generations following contact 
with mutagen. Dependent on the starting genetic 
background of the worms, the following defects can be 
observed in the progeny:- nematodes having shorter or 
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longer canals, nematodes having curved or extra 
branched canals, nematodes having ventral or dorsal 
canals, nematodes having more less than two canals, 
nematodes having wrongly branched canals , nematodes 
5 having vacuoles or cysts, nematodes with unusual 
features in the excretory canal. 

Example 3 

10 A method of detecting compounds that influence the 
excretory canal phenotype. 

Worms stably expressing GFP in the excretory 
canal can be used to detect and isolate compounds that 
have effects on the morphology of the excretory canal 
15 as follows: 

Standard agar plates for use with C. elegans are 
seeded with E.coli and left to full growth. Serial 
dilutions of compound to be tested are then pipetted 

20 onto the E.coli lawn and allowed to diffuse into the 
lawn. One transgenic C. elegans expressing GFP in the 
excretory canal (L4 stage) per dilution of compound is 
put onto the bacterial lawn. The plates are incubated 
at 21°C and visually screened for excretory canal 

25 morphology at various time intervals and for two 
generations to detect aberrant phenotypes. The 
following defects in excretory canal morphology can be 
observed; nematodes having shorter or longer canals, 
nematodes having curved or extra branched canals, 

30 nematodes having ventral or dorsal canals, nematodes 
having more or less than two canals, nematodes having 
wrongly branched canals, nematodes having vacuoles or 
cysts, nematodes with unusual features in the 
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excretory canal. 
Example 4 

5 Use of the UL6 minimal promoter fragment in expression 
of a heterologous DNA in the excretory canal . 

The above-described plasmids pGF2002, pGF2003, 
pGF2006, pGF2008, pGF2009 and pGF2012 have all been 
used to express GFP in the excretory cell and 

10 excretory canal of C. elegans. 

The same plasmids can be used to drive the 
transcription of other DNA fragments than the GFP 
encoding fragment by simply replacing this GFP 
encoding fragment with any other DNA fragment of 

15 interest. Preferentially the vectors pGF2006, pGF2009 
or pGF20012 are used for this purpose. As an 
alternative any of the isolated and analysed promoter 
fragments described above and shown in Figures 2-7 can 
be cloned upstream of the target DNA of interest in an 

20 expression vector suitable for use in C. elegans . 

Example 5 

Compound screening assay. 

25 The following method may be used to determine 

whether a compound is a modulator of growth cone 
steering, cell shape, cell motility, tumour formation, 
tumour vascularisation, metastasis, renal development, 
a pathway involved in kidney disease, development of 

30 the excretory canal, cytoskeletal organisation or 
surface to cytoskeleton signalling: 

Standard agar plates for use with C. elegans are 
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seeded with E.coli and left to full growth. Serial 
dilutions of compound to be tested are then pipetted 
onto the E.coli lawn and allowed to diffuse into the 
lawn. One transgenic C. elegans (L4 stage) per 
5 dilution of compound is put onto the bacterial lawn. 
The transgenic C. elegans is one expressing both a 
protein involved in the regulation of growth cone 
steering, cell shape, cell motility, tumour formation, 
tumour vascularisation, metastasis, renal development, 

10 a pathway involved in kidney disease, development of 
the excretory canal, cytoskeletal organisation or 
surface to cytoskeleton signalling and GFP in the 
excretory canal. The plates are incubated at 21°C and 
visually screened for excretory canal morphology at 

15 various time intervals and for two generations to 
detect aberrant phenotypes. 

The following abnormal excretory canal 
morphologies can be observed; nematodes having shorter 
or longer canals, nematodes having curved or extra 

20 branched canals, nematodes having ventral or dorsal 

canals, nematodes having more or less than two canals, 
nematodes having wrongly branched canals nematodes 
having vacuoles or cysts, nematodes with unusual 
features in the excretory canal. 

25 

Example 6 

Construction of a C. elegans strain stably expressing 
a reporter gene in the excretory canal . 

30 Although expression of reporter genes in the 

excretory canal from ext rachromosomal arrays is 
sufficient to perform most applications, stable 
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expression in C. elegans by integration woulci 
facilitate and improve most if not ail the 
applications. Several C. elegans strains have been 
constructed, wherein the DNA encoding for the promoter 
5 (here designated as UL6) that drives the expression of 

the reporter gene, in this case GFP, is integrated the 
in the genome. 

Method 

10 A general method to integrate ext rachromosomal 

DNA in to the genome of C. elegans has been described 
by Mello and Fire in Methods in Cell biology, Volume 
48, Caenorhabditls elegans: Modern biological analysis 
of an organism, Chapter 19, 466-468. Here the 

15 construction of a particular example is described in 
detail : 

A wild type N2 C. elegans strain was injected 
with pGF2006 (lOOng/pl) and with pUC18DNA (lOOng/pl) 
using standard methods, resulting in strain UG489 
20 bgExHc. Although a relatively high transmission 

efficiency (up to 55%) was observed, which troubles 
later selections, a clean expression of GFP was 
observed in the excretory cell and the excretory 
canal. 

25 9cm NGM plates with a population of late stage L4 

worms were gamma-irradiated for 1 hour in order to 
obtain a total intensity of 30 gray. 6 x 35 animals 
were isolated after irradiation, incubated for 24 
hours at 20°C and then transferred to fresh plates and 

30 further incubated at 25°C. 2 x 500 Fl animals were 
isolated after 48 hours and after 72 hours 
respectively, and further incubated at 25°C. 2-4 F2 ' s 
per Fl were isolated and incubated at 15°C. 
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The progeny of these F2 ' s was checked by selecting for 
plates with 75% or 100% of the progeny expressing GFP. 
Of these positive plates, 4 F3 1 s were isolated and 
further incubated at 15°C. Again the progeny was 
5 checked for the presence of 75% to 100% of the worms 
expressing GFP. 

The resulting integrated lines were crossed out 
several times to confirm that no major translocations 
or unlinked mutations have occurred as a result of the 

10 radiation treatment. This was done by mating wild-type 
C. elegans (N2) males with the hermaphrodites carrying 
the integration (the N2 strain can be obtained from 
CGC, University of Minnesota, USA) . Fl males are used 
to cross to the mapping strains and to cross back to 

15 N2 hermaphrodites (crossing out once every generation 
by always using GFP male progeny, except if the site 
of the integration is on X) . 

The integrations were mapped to the C. elegans 
chromosomes, by applying standard techniques well 

20 known in the art. Mapping was performed using at least 
following strains: MSI dpy-5 ( e61 ) I ; unc- 4 ( e 120 ) II ; 
lon-1 (el85) III and MT464 unc-5 (e53 ) IV; dpy-11 (e224 ) V; 
ion-2 (e678)X. 

2 5 Result 

Of a total of 1000 Fl worms primary isolated, 650 
lines were further examined. Out of these examined 
lines, eight integrated lines were retained, crossed 
out and mapped: 
30 UG703 bgls309 has strong hypodermal GFP expression, 
integrated on X; 

UG704 bgls310 has low hypodermal GFP expression, 
integrated on X; 
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UG705 bgls311 has nearly no hypodermal GFP expression, 
integrated on X; 

UG706 bgls312 has nearly no hypodermal GFP expression, 
integrated on I; 
5 UG707 bgls313 has strong hypodermal GFP expression, 
integrated on IV; 

UG708 bgls314 has nearly no hypodermal GFP expression, 
integrated on X; 

UG709 bgls315 has low hypodermal GFP expression, 
10 integrated on X; 

UG710 bgls316 has low hypodermal GFP expression, 
integrated on V. 

Conclusion 

15 Of the eight independent integrated lines obtained, at 
least five are of good quality (bgls311, bgls312, 
bgls314, bgls315 and bgls316) . 

2 0 Example 7 

Construction of improved vectors to express genes in 
the C. elegans excretory cell. 

The plasmid expression vectors pGF2006 and 
pGF2009 for expression of the reporter gene GFP both 

25 contain a nuclear localisation signal (NLS) in fusion 
with the reporter gene. Expression of the fusion 
protein (NLS-GFP) results in the translocation of a 
part of the total amount of the protein expressed to 
the nucleus of the excretory cell. Although the 

30 expression of the fusion protein is high enough to 
visualise the fusion protein in the whole excretory 
cell including the excretory canal, the present 
inventors decided to delete this NLS part. Expression 
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from the UL6 promoter would hence no more result in 
translocation in the nucleus of the expressed gene, 
and provide for a more equal localisation of the 
expressed gene. 
5 To test this hypothesis the NLS signal was 

deleted from the plasmids pGF2006 and pG2009. This was 
easily done by deletion of a Kpnl restriction fragment 
resulting in the plasmids pGF2013 and pGF2014 
respectively. 


SEQUENCE LISTING 

SEQ ID NO: 1 is the 
insert 

15 Figure 


nucleotide sequence of the 
of pUL6#64Al / illustrated in 
1. 


20 


SEQ ID NO: 2 is the nucleotide sequence of the 
insert of pGF2002, illustrated in 
Figure 2. 


SEQ ID NO: 3 is the nucleotide sequence of the 
insert of pGF2003, illustrated in 
Figure 3. 

SEQ ID NO: 4 is the nucleotide sequence of the 
25 insert of pGF2006, illustrated in 

Figure 4 . 


SEQ ID NO: 5 is the nucleotide sequence of the 
insert of pGF2008, illustrated in 
30 Figure 5. 


SEQ ID NO: 


6 


is nucleotide sequence of the insert of 
pGF2009, illustrated in Figure 6. 


WO 00/54815 


PCT/EPOO/02373 


- 40 


SEQ ID NO: 7 is the nucleotide sequence of the 
insert of pGF2012, illustrated in 
Figure 7. 

5 SEQ ID NO: 8 is the complete nucleotide sequence of 

plasmid pGF2006. 

SEQ ID NO: 9 is the complete nucleotide sequence of 
plasmid pGF2009. 

10 

SEQ ID NO: 10 is the complete nucleotide sequence of 
plasmid pGF2013 . 

SEQ ID NO: 11 is the complete nucleotide sequence of 
15 plasmid pGF2014 . 


20 


The contents of any published patents, patent 
applications and scientific publications referred to 
herein are to be incorporated herein by reference. 
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Claims : 

1. A DNA fragment which is capable of 
functioning as a promoter directing gene expression in 

5 the excretory cell of C. elegans, which DNA fragment 
comprises the sequence of nucleotides set forth in 
Figure 2 or a fragment thereof in the absence of any 
other sequence of consecutive nucleotides from the C. 
elegans genome. 

10 

2. A DNA fragment which is capable of 
functioning as a promoter directing gene expression in 
the excretory cell of C. elegans, which DNA fragment 
comprises the sequence of nucleotides set forth in 

15 Figure 3 or a fragment thereof in the absence of any 
other sequence of consecutive nucleotides from the C. 
elegans genome. 

3. A DNA fragment which is capable of 

20 functioning as a promoter directing gene expression in 
the excretory cell of C. elegans, which DNA fragment 
comprises the sequence of nucleotides set forth in 
Figure 4 or a fragment thereof in the absence of any 
other sequence of consecutive nucleotides from the C. 

25 elegans genome. 

4. A DNA fragment which is capable of 
functioning as a promoter directing gene expression in 
the excretory cell of C. elegans, which DNA fragment 

30 comprises the sequence of nucleotides set forth in 

Figure 5 or a fragment thereof in the absence of any 
other sequence of consecutive nucleotides from the C. 
elegans genome. 
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5. A DNA fragment which is capable of 
functioning as a promoter directing gene expression in 
the excretory cell of C. elegans, which DNA fragment 
comprises the sequence of nucleotides set forth in 

5 Figure 6 or a fragment thereof in the absence of any 
other sequence of consecutive nucleotides from the C. 
elegans genome. 

6. A DNA fragment which is capable of 

10 functioning as a promoter directing gene expression in 
the excretory cell of C. elegans, which DNA fragment 
comprises the sequence of nucleotides set forth in 
Figure 7 or a fragment thereof in the absence of any 
other sequence of consecutive nucleotides from the C. 

15 elegans genome. 

7. An expression vector which is suitable for 
directing tissue-specific expression of a heterologous 
DNA fragment in the excretory cell of C. elegans, said 

20 vector comprising a promoter comprising a DNA fragment 
as claimed in any one of claims 1 to 6 positioned to 
direct expression of the said heterologous DNA 
fragment . 

25 8 . An expression vector as claimed in claim 7 

wherein the heterologous DNA fragment is a reporter 
gene . 

9. An expression vector as claimed in claim 8 
30 wherein the reporter gene encodes green fluorescent 
protein, p-galactosidase , (3-lactamase, luciferase, 
acetohydroxyacid synthase, alkaline phosphatase, 
glucuronidase , chloramphenicol acetyl transferase , 
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horseradish peroxidase, nopaline synthase or octapine 
synthase . 

10. A host cell transformed or transfected with 
5 an expression vector as claimed in any one of claims 7 

to 9. 

11. A transgenic C. elegans containing a 
transgene comprising a promoter which is capable of 

10 directing tissue-specific gene expression in the 

excretory cell of C. elegans operatively linked to a 
protein-encoding DNA fragment. 

12. A transgenic C. elegans as claimed in claim 
15 11 wherein the promoter comprises a DNA fragment as 

claimed in any one of claims 1 to 6. 

13. A transgenic C. elegans as claimed in claim 
11 or claim 12 wherein the protein-encoding DNA 

20 fragment comprises a reporter gene encoding green 
fluorescent protein, (3-galactosidase, p-lactamase , 
luciferase, acetohydroxyacid synthase, alkaline 
phosphatase , p -glucuronidase , chloramphenicol 
acetyltransf erase, horseradish peroxidase, nopaline 

25 synthase or octapine synthase. 

14. A transgenic C. elegans as claimed in any 
one of claims 11 to 13 wherein said transgene is 
stably integrated into a chromosome of the said C. 

30 elegans. 

15. A transgenic C. elegans as claimed in claim 
11 or claim 12 which further comprises a second 
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transgene, which transgene comprises a promoter 
suitable for directing tissue-specific gene expression 
in the excretory ceil of C. elegans operativeiy linked 
to a reporter gene. 

5 

16. A transgenic C. elegans as claimed in claim 
15 wherein the promoter comprises a DNA fragment as 
claimed in any one of claims 1 to 6. 

10 17. A transgenic C. elegans as claimed in claim 

15 or claim 16 wherein the reporter gene encodes green 
fluorescent protein, p-galactosidase , ^-lactamase, 
luciferase, acetohydroxyacid synthase, alkaline 
phosphatase, p -glucuronidase, chloramphenicol 

15 acetyltransf erase, horseradish peroxidase, nopaline 
synthase or octapine synthase. 

18. A transgenic C. elegans as claimed in any 
one of claims 15 to 17 wherein one or both of the 

20 transgenes is integrated into the chromosome of the 
said C. elegans . 

19. A method of identifying a mutation in a gene 
involved in growth cone steering, cell motility, cell 

25 shape, tumour formation, metastasis, vascularisation 
of tumours, renal development, kidney disease, the 
development of the excretory canal, cytoskeletal 
organisation or surface to cytoskeleton signalling, 
which method comprises contacting a transgenic C. 

30 elegans which expresses a reporter gene in the 

excretory canal with a mutagen and screening for 
phenotypic changes in the excretory canal. 
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20. A method as claimed in claim 19 wherein the 
mutagen is EMS, UV-TMP or X-rays. 

21. A method as claimed in claim 19 or claim 20 
5 wherein the transgenic C. elegans contains a transgene 

comprising a promoter which is suitable for directing 
tissue-specific gene expression in the excretory canal 
of C. elegans operatively linked to a reporter gene. 

10 22. A method as claimed in claim 21 wherein the 

promoter comprises a DNA fragment as claimed in any 
one of claims 1 to 6. 

23. A method as claimed in any one of claims 19 
15 to 22 wherein the transgenic C. elegans is a wild-type 

strain or a selected mutant strain. 

24. A method as claimed in any one of claims 19 
to 23 wherein the reporter gene encodes green 

20 fluorescent protein, p-galactosidase, p-lactamase, 
luciferase, acetohydroxyacid synthase, alkaline 
phosphatase, £ -glucuronidase , chloramphenicol 
acetyltransf erase, horseradish peroxidase, nopaline 
synthase or octapine synthase. 

25 

25. A method of determining whether a compound 
is an inhibitor or an enhancer of the regulation of 
growth cone steering, cell motility, cell shape, 
tumour formation, metastasis, vascularisat ion of 

30 tumours, renal development, pathways involved in 

kidney disease, development of the excretory canal, 
cytoskeletal organisation or surface to cytoskeleton 
signalling, which method comprises contacting a sample 
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of the compound with a transgenic C. elegans which 
expresses a reporter gene in the excretory canal and 
screening for phenotypic changes in the excretory 
canal . 

5 

26. A method as claimed in claim 25 wherein said 
transgenic C. elegans is a wild-type strain or a 
selected mutant strain. 

10 27 . A method as claimed in claim 25 or claim 26 

wherein the transgenic C. elegans contains a transgene 
comprising a promoter which is suitable for directing 
tissue-specific gene expression in the excretory canal 
of C. elegans operatively linked to a reporter gene. 

15 

28. A method as claimed in claim 27 wherein the 
promoter comprises a DNA fragment as claimed in any 
one of claims 1 to 6 . 


20 29. A method as claimed in any one of claims 25 

to 28 wherein the reporter gene encodes green 
fluorescent protein, p-galactosidase, p-lactamase, 
luciferase, acetohydroxyacid synthase, alkaline 
phosphatase, 3 -glucuronidase, chloramphenicol 

25 acetyltransf erase, horseradish peroxidase, nopaline 
synthase or octapine synthase. 

30. A compound which is identifiable as an 
inhibitor or an enhancer of the regulation of growth 
30 cone steering, cell motility, cell shape, tumour 

formation, metastasis, vascularisation of tumours, 
renal development, pathways involved in kidney 
disease, development of the excretory canal, 
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cytos keletal organisation or surface to cytoskeleton 
signalling using the method of any one of claims 25 to 
29. 

5 31. A compound as claimed in claim 30 for use as 

a medicament for promoting neuronal regeneration, re- 
vascularisation or wound healing. 

32. A compound as claimed in claim 30 for use as 
10 a medicament for the treatment of metastasis, tumour 

formation, tumour vascularisation, chronic neuro- 
degeneration, kidney disease, kidney cyst formation, 
polycystic kidney diseases, cell migration diseases or 
immunological diseases . 

15 

33. Use of a compound as claimed in claim 30 in 
the manufacture of a medicament for promoting neuronal 
regeneration, re-vascularisation or wound healing. 

20 34. Use of a compound as claimed in claim 30 in 

the manufacture of a medicament for the treatment of 
metastasis, kidney disease, kidney cyst formation, 
polycystic kidney diseases, cell migration diseases or 
immunological diseases . 

25 

35. A pharmaceutical composition comprising a 
compound as claimed in any one of claims 30 to 34 and 
a pharmaceutical^* acceptable carrier, diluent or 
excipient therefor . 

30 

36. A method of determining the function of the 
protein encoded by a DNA fragment, which method 
comprises the steps of; 
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(a) providing a transgenic C. elegans containing 
a transgene comprising a promoter suitable for 
directing tissue-specific gene expression in the C. 
elegans excretory ceil operatively linked to the said 

5 DNA fragment; and 

(b) observing any phenotypic changes in the 
excretory canal of the transgenic C. elegans. 

37. A method as claimed in claim 35 wherein the 
10 promoter comprises a DNA fragment as claimed in any 

one of claims 1 to 6. 

38. A method as claimed in claim 36 or claim 37 
wherein the transgenic C. elegans further comprises a 

15 second transgene comprising a promoter suitable for 
directing tissue-specific gene expression in the 
excretory cell of C. elegans operatively linked to a 
reporter gene. 

20 39. A method as claimed in claim 38 wherein the 

promoter of the second transgene comprises a DNA 
fragment as claimed in any one of claims 1 to 6. 

40. A method as claimed in claim 38 or claim 39 
25 wherein the reporter gene encodes green fluorescent 

protein, p-galactosidase, p-lactamase, luciferase, 
acetohydroxyacid synthase, alkaline phosphatase, p- 
glucuronidase , chloramphenicol acetylt ransf erase, 
horseradish peroxidase, nopaline synthase or octapine 
30 synthase. 

41. A method of determining whether a compound 
is a modulator of growth cone steering, cell shape, 
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tumour formation, metastasis, vascularisat ion of 
tumours, cell motility, renal development, a pathway 
involved in kidney disease, development of the 
excretory canal, cytoskeletal organisation or surface 
5 to cytoskeleton signalling, which method comprises the 
steps of; 

(a) contacting a sample of the compound with a 
transgenic C. elegans expressing a DNA fragment 
encoding a protein involved in the regulation of 

10 growth cone steering, cell shape, cell motility, renal 
development or a pathway involved in kidney disease, 
which transgenic C. elegans contains a transgene 
comprising a promoter which is suitable for directing 
tissue-specific gene expression in the C. elegans 

15 excretory cell operatively linked to the said DNA 
fragment; and 

(b) screening for phenotypic changes in the 
excretory canal. 

20 42. A method as claimed in claim 41, wherein 

said transgenic C. elegans is a wild-type strain or a 
selected mutant strain. 

43. A method as claimed in claim 41 or claim 42 
25 wherein the promoter comprises a DNA fragment as 

claimed in any one of claims 1 to 6. 

44. A method as claimed in any one of claims 41 
to 43 wherein the transgenic C. elegans further 

30 contains a second transgene comprising a promoter 

suitable for directing tissue-specific gene expression 
in the excretory cell of C. elegans operatively linked 
to a reporter gene. 
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45. A method as claimed in claim 44 wherein the 
promoter of the second transgene comprises a DNA 
fragment as claimed in any one of claims 1 to 6. 

5 46. A method as claimed in claim 44 or claim 45 

wherein the reporter gene encodes green fluorescent 
protein, p-galactosidase, p-lactamase, luciferase, 
acetohydroxyacid synthase, alkaline phosphatase, p- 
glucuronidase, chloramphenicol acetyl transf erase , 
10 horseradish peroxidase, nopaline synthase or octapine 
synthase . 

47. A method as claimed in any one of claims 41 
to 46 wherein the transgenic C. elegans has an 

15 abnormal excretory canal phenotype. 

48. A method as claimed in any one of claims 41 
to 47 wherein the DNA fragment expressed in the 
excretory cell of the transgenic C. elegans rescues an 

20 abnormal excretory canal phenotype which is present in 
the genetic background of the transgenic C. elegans. 

49. A compound which is identifiable as a 
modulator of growth cone steering, cell shape, cell 

25 motility, tumour formation, metastasis, 

vascularisat ion of tumours, renal development, a 
pathway involved in kidney disease, development of the 
excretory canal, cytoskeletal organisation or surface 
to cytoskeleton signalling using the method of any one 

30 of claims 41 to 49. 


50. A compound as claimed in claim 49 for use as 
a medicament for promoting neuronal regeneration, re- 
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vascular isat ion or wound healing. 

51. A compound as claimed in claim 4 9 for use as 
a medicament for the treatment of chronic neuro- 
5 degenerative diseases, metastasis, tumour formation, 
tumour vascularisation, kidney diseases, kidney cyst 
formation, polycystic kidney diseases, cell migration 
diseases or immunological diseases. 

10 52. Use of a compound as claimed in claim 49 in 

the manufacture of a medicament for promoting neuronal 
regeneration, re-vascularisation or wound healing. 

53. Use of a compound as claimed in claim 49 in 
15 the manufacture of a medicament for the treatment of 

chronic neuro-degenerative diseases, metastasis, 
tumour formation, tumour vascularisation, kidney 
diseases, kidney cyst formation, polycystic kidney 
diseases, cell migration diseases or immunological 
20 diseases . 

54. A pharmaceutical composition comprising a 
compound as claimed in any one of claims 49 to 53 and 
a pharmaceutical^* acceptable carrier, diluent or 

25 excipient therefor. 

55. A method of identifying further components 
of a biochemical pathway on which a compound 
identifiable as a modulator of growth cone steering, 

30 cell shape, cell motility, renal development, a 

pathway involved in kidney disease, development of the 
excretory canal, cytoskeletal organisation or surface 
to cytoskeleton signalling may act, which method 
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comprises the steps of: 

(a) providing a transgenic C. elegans which 
expresses a reporter gene in the excretory canal; 

(b) contacting the transgenic C. elegans with a 
5 mutagen; 

(c) contacting the mutated C. elegans with a 
compound which is identifiable as a modulator of 
growth cone steering, cell shape, cell motility, renal 
development, a pathway involved in kidney disease, 

10 development of the excretory canal, cytoskeletal 

organisation or surface to cytoskeleton signalling; 
and 

(d) screening for phenotypic changes in the 
excretory canal. 

15 

56. A method as claimed in claim 55 wherein the 
mutagen is EMS, UV-TMP or X-rays. 


57. A method as claimed in claim 55 or claim 56 
20 wherein the transgenic C. elegans contains a transgene 
comprising a promoter which is suitable for directing 
tissue-specific gene expression in the excretory canal 
of C. elegans operatively linked to a reporter gene. 

25 58. A method as claimed in claim 57 wherein the 

promoter comprises a DNA fragment as claimed in any 
one of claims 1 to 6. 


59. A method as claimed in any one of claims 55 
30 to 58 wherein the reporter gene encodes green 

fluorescent protein, p-gaiactosidase, p-lactamase or 
lucif erase . 
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gatcgctcgaataaaaaattttataatgtcagattacgttttagatccaaaaaaa 
attaggctgaatccgtttttgattctctagaattttgtttagccaaaaaacatcg 
cattgttctctctgtcgttattaaagtgcgcaattaacaaatattcaaaaattct 
tatttaaaaaaccatagggggttccctaactattgcgaaattctctttctctctc 
ggacattactatgagagattgaaccaattgaagagacgcagtgctaggaagggaa 
tgcgccctggcgccacatatggtctaagtctctctgaatcgtgtgctctctacat 
tgggtcgactctgcgagtgcttttgagcggcgccgagaccgaaataagagagaaa 
acagagggaaaaaagatagggaacattaattgatgaaaaaagaaacagatgacgt 
ggcaattcttctttccggagtttttttttattggaagtggggttattcaagtaat 
gtagcaaaatgtatttaaatacatttgtgacgtcacaaatgtatttaaatacatg 
tttttatatacttgaataaggttgtgacgtaatttttctacactttttaattttc 
cgacactacttgaataaccccaaaagtgtacggtttcttttttcaaaacacgatt 
gcaaccaaaggcgccggatttttgaatttttcaaaaatcggaagatttaaaattt 
tcgcttttttatttttattacttgataaaaattgaattttatggtgaaatttcaa 
aatttgttttggcttttcagttgtccggaaaatcgaaaatgtttgtttttcgatt 
cccactgaaaaaatcgaattttttcgatgaaacctittgattgcaatgttagaaaa 
atttctaaaaaattaacaagaacactggaacaacacaaaatcacaataatcaaca 
ccggaccattggtaccgtataaaaagaaaaaaaaatcgaattaattgtaatagtt 
ctgaaaatctttcataaatacccattttgaatataattatttcttaataaatata 
taacttacagagaaaaaaatttcggggacgttaaggctcatagaagaacacacga 
aaattaggtaagttaaggctgaaggcatttggcctactctacacgtagcagggaa 
atgatgtaaaatggaaaagagagaaaatatataaaacatatgaatgtgcaaggat 
tcggtatataaattatatatgaagggtccttattaaggaatttgttttcctgtat 
gtacaatctagaaaagagcagaagggttattgaaagggtggagtagagtcaaaat 
tgtttttttactttaaattacagaaaatggacactaatgacggaatataactata 
aacattttttctaaattttgaaaaatgattttttcaattttgcacctctcaaatt 
aagagcactaaagaaaattacagaaaaaccagtaaaatttgagaatttgaaacgg 
atatctccaaaaagtagtcgaagtctaagaatcaaaaaagaaattaaaatttttt 
ttagttacggtatattttctgtcatttaaagcagttgactccactccacctttaa 
aacaatataaattatctaatgaggcctagcagaatatcggttcctcaaatacgga 
ttatagggtgtcatatcgccttttctccattcgctccaaacgattttggcgacaa 
ccagatgaaaagcgacgacgacaacaaaaacgaacacccagaatgttgtcattcc 
tccagttgattctccagatcgatgcatttcaataacttcttcttcgtgtgccggt 
gctgcaccaacttccgacacgtgaatggcatttcttaggagttcaagatgtcgtc 
gagatggagcaatgtagagaagagatgttactggtttcgtgacttttactggtaa 
ttgaactggagtctttccaagtgaaactgggctatcgaggcatccaccatcacaa 
acaaataattgactagaagatgaaacctgaaagaaatagtttagagtatttccag 
gataaatatttttaaaatttaaataaaagacgtgtttcagctcttttgacaagga 
tttttattcattttttgagaagtctcactatgaaattcggggccagttcagtcta 
tttaagcaaca-agaaaccacaaactatgctacagttgtaaaacattctataaact 
attaccataaaataaggacgattttccgtatccaattttatatccactcgaatag 
taccaacaacacccatattcgaataaactggaattccatgaaaattcatttcacg 
atgcaaatcatgagcarccaatgctacttccaaatgagaatcgtgataactcgat 
gcagtcaacgattgaacaaatgcttgtgatgctccaattgctccagtgctcaaaa 
gattgttgcaacctcgrttttggaaagtgagcatccatgtattcacgagctcaaa 
aagttttgtcggagtatcccagtcttgccacagacgtgatgggaagagaagattt 
gaatgaccagtgtagcagagctcgcttttccgcgccaaattttccagaggctata 
aacattctattttaattaggtaataatattcaattacctttacagattccaaagt 
agccccatgctcaattgttggagcccgtaaattgctcattaaaatgtattttgtt 
gcattaattctatcrccgtttaatgtatttggtgctttcgaattcgcaacactga 
atcctgttaaatgcaucttgctccatccgtcaaaatgctccgaatccaaagtttt 
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cgagtcttttcctgattggaaaaatcggatagcatcccgagaagcttcttcactt 
gttggcttcgacgcaatcgagcatgtaaaccgaaatgactcttctctgaaataaa 
ctgaacaaattaattaatttttttaatgttttaaatataccttttctgtgaaact 
gaaacttttccaggtgcatcagtacaaacaacagcatacgatgctctagcaactt 
gccgctgatgaaccgtcagctcgcctaattttgaatgagcccaccagttggaact 
aagacgagacaagttgagtcggaccgttgatttctgaaattaaatgttctttttg 
taattttaatgcatcaatttcaataaaattacagttggattggaaacttgtatca 
attgaacaaaatagtttggacgagttcgatgagcccatatctgtgacgtcacaca 
tgcacattcgccatcctgaaacattaatgtccaataaatagtttattatgtcgtg 
gctaaccattgaaaaacatcgcagaattttgaaaatgccatttttgaaatccgtc 
atttttgtaacctcaacaggttgttgtccgtcaatttcaatgttgagcccgggat 
aaaaattggtttccacatcgagaactcggtttttctcctgaactcgtaattgtcc 
gaatgaatcaacaccaatatatccatttcccacaaactctacgctttgattgtta 
attcctgtatcgaactcctaaaaaaaagtaaatatgaataattactaattattcg 
actcacatcaatctctttttcccataattttaacttttccgctgaacattgatgc 
tgttttgtgaatgatgtgccggaagacgttcctcgcgagaagatgaaaatgacga 
ataaaacaactccaataattatcgctaatcttcgttttggatatcgtcgaaccat 
acgcattttactttcgaaatccatattttctattcgacgcacgaaagtgcgccac 
gttattgcggcgctcattttggaggggaaatccgcctgaaaaatcaattgtttgt 
attgtgaaatttcgaagaggcataaaacaagaaaacggacatgaaagcgcgttgc 
atgcaaggttagttgcctgtttaagcattatccccgcatgtagcttgttcggcac 
cgttaaaaatgctgagtaatcagctttttagaatttaaaatattaaacttttaaa 
attgcaacaaacatcgacaaatattcaagaggcgaatgatatcgggaatttcgat 
tgaaacgaaactgttttgaaattcaaaaagtattttcaaagtattgtccgcaagg 
cacatcacgcaaacttigcagaatctaccgtatcccatacatttttatagtttttc 
cctcagtttttaaaattaaaaacgctgaaaaagcgattaaatttatttaaatgca 
tcgt:tcgaataaaataaagt:ttatcttt:tgataaaaacatgagtttcct:ttggag 
aaaagtagggatttcgcctttcaaaaaattatttcgtgcaggatgctattttcgt 
ggcgaaacccatacccaagagctcatgcgtcttcttgattactgtagatgtttgg 
caacttatttttacataaaaacgttttcattcattiatttccaticattcatttatic 
tttctgtgtttttagttagttttagctagtttttttctaaattcctaactttaaa 
aaatctggaaaagaaaattaaaaaattttgtccctattatttattttattactgg 
aaaatcttcaaacaggaaaacccaccgcgttttgcttattgctgtatttatgaaa 
aaaaaaacaatcaatattggtcaagtaaataagaaaaaattaacgaatctctatc 
tgacaccagatgcgaccctctattccacttctctgttcatctgctgcttcttttg 
tttaaccagataaatctccctcggggaaaaccgtcaaaaaaaggcaaactaaatg 
caaacacgctctatagacaaaatgtgtttggtctcgtcacgaatggtgagagaga 
attggcctccgccgcagagatcgcttgattattggcctccagtgggcaatgtcgg 
ggaaaaccaaactattgatgagaggtatcgacgaaaaatcaaCaatgaccaactt 
tttgttacagttttgttataaatatgagttttggatattccattgcgtatttttc 
ttttictactttcaaaaaatctgctccaacctttaatggcttttcctgtcttgtca 
aaatctggatttttgaatatataatttttaaaaccatcaaattcagcgaaatgaa 
atcatgtaatacaartttttattttttccgactgttgtgtattccatcaaactat 
tcaaaaaatcaatacaatgatttttttttcatttttcgcgattttttattatttt 
gtcgtctgaaaaccrttttactaataaaataatttacagggaaaaccactaacga 
ctgtagccatgggaatcagcgacaacgacgttcagaagcagctccgccacatgat 
ggctttcattgagcaagaggccaatgagaaggctgaggagatcgatgctaaagcc 
gaggaagaattcaacattgagaaagtaaggaattaaaacatttactcctttaaaa 
ctatactaaaatctcttctaaaaaacggaaaaccttgaaattatgaattcattca 
aattgtttcagggacgtcttgttcaacaacaacgtcaaaagattatggaattctt 
cgagaagaaggagaaacaagtcgagcttcaacgcaaaattcaagcctccaactct 
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ctcaacgctggacgtcttcgttgcttgaaggtgagagaaaacgtttctcaacatt 
ttcaaaaacattaatcgccttaaaattgaaaaccagttctgaatcggacacattt 
gaattaaaaacatattttcaggctcgtgaagaccacatcggagccgtactcgacg 
aggctcgctcgaatctctcccgtatttccggagatgctgctcgttatccagctat 
tttgaagggacttgtcatgcaaggacttcttcaattgctcgaaaaggaagtcgtc 
cttcgttgccgtgagaaggatcttcgtcttgttgagcaacttttgccagagtgcc 
ttgacggacttcaaaaggagtggggaagcaccaccaaggtcgt.tct:cgataaaca 
aaacttcttgccatcggagtctgctggaggagttgaactttctgctcgtgctgga 
aagatccccgggattggcca 
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aagcttcttcacttgttggcttcgacgcaatcgagcatgtaaaccgaaatgactc 

ttctctgaaataaactgaacaaattaattaatttttttaatgttttaaatatacc 

ttttctgtgaaactgaaacttttccaggtgcatcagtacaaacaacagcatacga 

tgctctagcaacttgccgctgatgaaccgtcagctcgcctaattttgaatgagcc 

caccagttggaactaagacgagacaagttgagtcggaccgttgatttctgaaatt 

aaatgttctttttgtaattttaatgcatcaatttcaataaaattacagttggatt 

ggaaacttgtatcaattgaacaaaatagtttggacgagttcgatgagcccatatc 

tgtgacgtcacacatgcacattcgccatcctgaaacattaatgtccaataaatag 

tttattatgtcgtggctaaccattgaaaaacatcgcagaattttgaaaatgccat 

ttttgaaatccgtcatttttgtaacctcaacaggttgttgtccgtcaatttcaat 

gttgagcccgggataaaaattggtttccacatcgagaactcggtttttctcctga 

actcgtaattgtccgaatgaatcaacaccaatatatccatttcccacaaactcta 

cgctttgattgttaattcctgtatcgaactcctaaaaaaaagtaaatatgaataa 

ttactaattattcgactcacatcaatctctttttcccataattttaacttttccg 

ctgaacattgatgctgttttgtgaatgatgtgccggaagacgttcctcgcgagaa 

gatgaaaatgacgaataaaacaactccaataattatcgctaatcttcgttttgga 

tatcgtcgaaccatacgcattttactttcgaaatccatattttctattcgacgca 

cgaaagtgcgccacgttattgcggcgctcattttggaggggaaatccgcctgaaa 

aatcaattgtttgtattgtgaaatttcgaagaggcataaaacaagaaaacggaca 

tgaaagcgcgttgcatgcaaggttagttgcctgtttaagcattatccccgcatgt 

agcttgttcggcaccgttaaaaatgctgagtaatcagctttttagaatttaaaat 

attaaacttttaaaattgcaacaaacatcgacaaatattcaagaggcgaatgata 

tcgggaatttcgattgaaacgaaactgttttgaaattcaaaaagtattttcaaag 

tattgtccgcaaggcacatcacgcaaacttgcagaatctaccgtaticccatacat 

ttttatagtttttccctcagtttttaaaattaaaaacgctgaaaaagcgattaaa 

tttatttaaatgcatcgttcgaataaaataaagtttatcttttgataaaaacatg 

agtttcctttggagaaaagtagggatttcgcctttcaaaaaattatttcgtgcag 

gatgctattttcgtggcgaaacccatactcaagagctcatgcgtcttcttigatta 

ctgtagatgtttggcaacttatttttacataaaaacgttttcattcattatttcc 

atcattcatttatctttctgtgtttttagttagttttagctagtttttttctaaa 

ttcctaactttaaaaaatctggaaaagaaaattaaaaaattttgtccctattatt 

tattttattactggaaaatcttcaaacaggaaaacccaccgcgttttgcttattg 

ctgtatttatgaaaaaaaaaacaatcaatattggtcaagtaaataagaaaaaatt 

aacgaatctctatctgacaccagatgcgaccctctattccacttctctgttcatc 

tgctgcttcttttgtttaaccagataaatctccctcggggaaaaccgtcaaaaaa 

aggcaaactaaatgcaaacacgctctatagacaaaatgtgtttggtctcgticacg 

aatggtgagagagaattggcctccgccgcagagatcgcttgattattggcctcca 

gtgggcaatgtcggggaaaaccaaactattgatgagaggtatcgacgaaaaatca 

acaatgaccaactttttgttacagttttgttataaatatgagttttggatattcc 

attgcgtatttttcT:t:ttct:acttrcaaaaaatctgctccaacctt:taat:ggctt 

ttcctgtcttgtcaaaatctggatttttgaatatataatttttaaaaccatcaaa 

ttcagcgaaatgaaatcatgtaatacaattttttattttttccgactgttgtgta 

ttccatcaaactattcaaaaaatcaatataatgatttttttttcatttttcgcga 

ttttttattatttcgtcgtctgaaaacctttttactaataaaataatttacaggg 

aaaaccactaacgaccgtagccatgggaatcagcgacaacgacgttcagaagcag 

ctccgccacatgacggctttcattgagcaagaggccaatgagaaggctgaggaga 

tcgatgctaaagccgaggaagaattcaacattgagaaagtaaggaattaaaacat 

ttactcctttaaaactatactaaaatctcttctaaaaaacggaaaaccttgaaat 

tatgaattcattcaaattgtttcagggacgtcttgttcaacaacaacgtcaaaag 

attatggaattcc-cgagaagaaggagaaacaagtcgagcttcaacgcaaaattc 

aagcctccaactc-ctcaacgctggacgtcttcgttgctitgaaggtgagagaaaa 
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cgtttctcaacattttcaaaaacattaatcgccttaaaattgaaaaccagttctg 
aatcggacacatttgaattaaaaacatattttcaggctcgtgaagaccacatcgg 
agccgtactcgacgaggctcgctcgaatctctcccgtatttccggagatgctgct 
cgttatccagctattttgaagggacttgtcatgcaaggacttcttcaattgctcg 
aaaaggaagtcgtccttcgttgccgtgagaaggatcttcgtcttgttgagcaact 
tttgccagagtgccttgacggacttcaaaaggagtggggaagcaccaccaaggtc 
gttctcgataaacaaaacttcttgccatcggagtctgctggaggagttgaacttt 
ctgctcgtgctggaaagatccccgggattggcca 

gcatgcaaggttagttgcctgtttaagcattatccccgcatgtagcttgttcggc 
accgttaaaaatgctgagtaatcagctttttagaatttaaaatattaaactttta 
aaattgcaacaaacatcgacaaatattcaagaggcgaatgatatcgggaatttcg 
attgaaacgaaactgttttgaaattcaaaaagtattttcaaagtattgtccgcaa 
ggcacatcacgcaaacttgcagaatctaccgtatcccatacatttttatagtttt 
tccctcagtttttaaaattaaaaacgctgaaaaagcgattaaatttatttaaatg 
catcgttcgaataaaataaagtttatcttttgataaaaacatgagtttcctttgg 
agaaaagtagggatttcgcctttcaaaaaattatttcgtgcaggatgctattttc 
gtggcgaaacccatactcaagagctcatgcgtcttcttgattactgtagatgttt 
ggcaacttatttttacataaaaacgttttcattcattatttccatcattcattta 
tctttctgtgtttttagttagttttagctagtttttttctaaattcctaacttta 
aaaaatctggaaaagaaaattaaaaaattttgtccctattatttattttattact 
ggaaaatcttcaaacaggaaaacccaccgcgttttgcttattgctgtatttatga 
aaaaaaaaacaatcaatattggtcaagtaaataagaaaaaattaacgaatctcta 
tctgacaccagatgcgaccctctattccacttctctgttcatctgctgcttcttt 
tgtttaaccagataaatctccctcggggaaaaccgtcaaaaaaaggcaaactaaa 
tgcaaacacgctctatagacaaaatgtgtttggtctcgtcacgaatggtgagaga 
gaattggcctccgccgcagagatcgcttgattattggcctccagtgggcaatgtc 
ggggaaaaccaaactattgatgagaggtatcgacgaaaaatcaacaatgaccaac 
tttttgttacagttttgttataaatatgagttttggatattccattgcgtatttt 
tcttttctactttcaaaaaatctgctccaacctttaatggcttttcctgtcttgt 
caaaatctggatttttgaatatataatttttaaaaccatcaaattcagcgaaatg 
aaatcatgtaatacaattttttattttttccgactgttgtgtattccatcaaact 
attcaaaaaatcaatataatgatttttttttcatttttcgcgattttttattatt 
ttgtcgtctgaaaacctttttactaataaaataatttacagggaaaaccactaac 
gactgtagccatgggaatcagcgacaacgacgttcagaagcagctccgccacatg 
atggctttcattgagcaagaggccaatgagaaggctgaggagatcgatgctaaag 
ccgaggaagaattcaacattgagaaagtaaggaattaaaacatttactcctttaa 
aactatactaaaatctcttctaaaaaacggaaaaccttgaaattatgaattcatt 
caaattgtttcagggacgtcttgttcaacaacaacgtcaaaagattatggaattc 
ttcgagaagaaggagaaacaagtcgagcttcaacgcaaaattcaagcctccaact 
ctctcaacgctggacgtcttcgttgcttgaaggtgagagaaaacgtttctcaaca 
ttttcaaaaacattaatcgccttaaaattgaaaaccagttctgaatcggacacat 
ttgaattaaaaacarattttcaggctcgtgaagaccacatcggagccgtactcga 
cgaggctcgctccaatctctcccgtatttccggagatgctgctcgttatccagct 
attttgaagggac—gtcatgcaaggacttcttcaattgctcgaaaaggaagtcg 
tccttcgttgccg-gagaaggatcttcgtcttgttgagcaacttttgccagagtg 
ccttgacggacttcaaaaggagtggggaagcaccaccaaggtcgttctcgataaa 
caaaacttcttgccaticggagtctgctggaggagttgaactttctgctcgtgctg 
gaaagatccccgggattiggcca 
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gcatgcaaggttagttgcctgtttaagcattatccccgcatgtagcttgttcggc 
accgttaaaaatgctgagtaatcagctttttagaatttaaaatattaaactttta 
aaattgcaacaaacatcgacaaatattcaagaggcgaatgatatcgggaatttcg 
attgaaacgaaactgttttgaaattcaaaaagtattttcaaagtattgtccgcaa 
ggcacatcacgcaaacttgcagaatctaccgtatcccatacatttttatagtttt 
tccctcagtttttaaaattaaaaacgctgaaaaagcgattaaatttatttaaatg 
catcgttcgaataaaataaagtttatcttttgataaaaacatgagtttcctttgg 
agaaaagtagggatttcgcctttcaaaaaattatttcgtgcaggatgctattttc 
gtggcgaaacccatactcaagagctcatgcgtcttcttgattactgtagatgttt 
ggcaacttatttttacataaaaacgttttcattcattatttccatcattcattta 
tctttctgtgtttttagttagttttagctagtttttttctaaattcctaacttta 
aaaaatctggaaaagaaaattaaaaaattttgtccctattatttattttattact 
ggaaaatcttcaaacaggaaaacccaccgcgttttgcttattgctgtatttatga 
aaaaaaaaacaatcaatattggtcaagtaaataagaaaaaattaacgaatctcta 
tctgacaccagatgcgaccctctattccacttctctgttcatctgctgcttcttt 
tgtttaaccagataaatctccctcggggaaaaccgtcaaaaaaaggcaaactaaa 
tgcaaacacgctctatagacaaaatgtgtttggtctcgtcacgaatggtgagaga 
gaattggcctccgccgcagagatcgcttgattattggcctccagtgggcaatgtc 
ggggaaaaccaaactattgatgagaggtatcgacgaaaaatcaacaatgaccaac 
tttttgttacagttttgttataaatatgagttttggatattccattgcgtatttt 
tcttttctactttcaaaaaatctgctccaacctttaatggcttttcctgtcttgt 
caaaatctggatttttgaatatataatttttaaaaccatcaaattcagcgaaatg 
aaatcatgtaatacaattttttattttttccgactgttgtgtattccatcaaact 
attcaaaaaatcaatataatgatttttttttcatttttcgcga 
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gagctcatgcgtcttcttgattactgtagatgtttggcaacttatttttacataa 

aaacgttt teat teat tatttcca teat teat ttatctttctgtgtttttagtta 

gttttagctagtttttttctaaattcctaactttaaaaaatctggaaaagaaaat 

taaaaaattttgtccctattatttattttattactggaaaatcttcaaacaggaa 

aacccaccgcgttttgcttattgctgtatttatgaaaaaaaaaacaatcaatatt 

ggtcaagtaaataagaaaaaattaacgaatctctatctgacaccagatgcgaccc 

tctattccacttctctgttcatctgctgcttcttttgtttaaccagataaatctc 

cctcggggaaaaccgtcaaaaaaaggcaaactaaatgcaaacacgctctatagac 

aaaatgtgtttggtctcgtcacgaatggtgagagagaattggcctccgccgcaga 

gatcgcttgattattggcctccagtgggcaatgtcggggaaaaccaaactattga 

tgagaggtatcgacgaaaaatcaacaatgaccaactttttgttacagttttgtta 

taaatatgagttttggatattccattgcgtatttttcttttctactttcaaaaaa 

tctgctccaacctttaatggcttttcctgtcttgtcaaaatctggatttttgaat 

atataatttttaaaaccatcaaattcagcgaaatgaaatcatgtaatacaatttt 

ttattttttccgactgttgtgtattccatcaaactattcaaaaaatcaatataat 

gatttttttttcatttttcgcgattttttattattttgtcgtctgaaaacctttt 

tactaataaaataatttacagggaaaaccactaacgactgtagccatgggaatca 

gcgacaacgacgtccagaagcagctccgccacatgatggctttcattgagcaaga 

ggccaatgagaaggctgaggagatcgatgetaaagccgaggaagaattcaacatt 

gagaaagtaaggaattaaaacatttactcctttaaaactatactaaaatctcttc 

taaaaaacggaaaaccttgaaattatgaattcattcaaattgtttcagggacgtc 

ttgttcaacaacaacgtcaaaagattatggaattcttcgagaagaaggagaaaca 

agtcgagcttcaacgcaaaattcaagcctccaactctctcaacgctggacgtctt 

cgttgcttgaaggtgagagaaaacgtttctcaacattttcaaaaacattaatcgc 

cttaaaattgaaaaccagttctgaatcggacacatttgaattaaaaacatatttt 

caggctcgtgaagaccacatcggagccgtactcgacgaggctcgctcgaatctct 

cccgtatttccggagatgctgctcgttatccagctattttgaagggacttgtcat 

gcaaggacttcttcaattgctcgaaaaggaagtcgtccttcgttgccgtgagaag 

gatcttcgtcttgttgagcaacttttgccagagtgccttgacggacttcaaaagg 

agtggggaagcaccaccaaggtcgttctcgataaacaaaacttcttgccatcgga 

gtctgctggaggagttgaactttctgctcgtgctggaaagatccccgggattggc 

ca 
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ttttcattcattatttccatcattcatttatctttctgtgtttttagttagtttt 
agctagtttttttctaaattcctaactttaaaaaatctggaaaagaaaattaaaa 
aattttgtccctattatttattttattactggaaaatcttcaaacaggaaaaccc 
accgcgttttgcttattgctgtatttatgaaaaaaaaaacaatcaatattggtca 
agtaaataagaaaaaattaacgaatctctatctgacaccagatgcgaccctctat 
tccacttctctgttcatctgctgcttcttttgtttaaccagataaatctccctcg 
gggaaaaccgtcaaaaaaaggcaaactaaatgcaaacacgctctatagacaaaat 
gtgtttggtctcgtcacgaatggtgagagagaattggcctccgccgcagagatcg 
cttgattattggcctccagtgggcaatgtcggggaaaaccaaactattgatgaga 
ggtatcgacgaaaaatcaacaatgaccaactttttgttacagttttgttataaat 
atgagttttggatattccattgcgtatttttcttttctactttcaaaaaatctgc 
tccaacctttaatggcttttcctgtcttgtcaaaatctggatttttgaatatata 
atttttaaaaccatcaaattcagcgaaatgaaatcatgtaatacaattttttatt 
ttttccgactgttgtgtattccatcaaactattcaaaaaatcaatataatgattt 
ttttttcatttttcgcgat 


ggtctcgtcacgaatggtgagagagaattggcctccgccgcagagatcgcttgat 
tattggcctccagtgggcaatgtcggggaaaaccaaactattgatgagaggtatc 
gacgaaaaatcaacaatgaccaactttttgttacagttttgttataaatatgagt 
tttggatattccattgcgtatttttcttttctactttcaaaaaatctgct.ccaac 
ctttaatggcttttcctgtcttgtcaaaatctggatttttgaatatataattttt 
aaaaccatcaaattcagcgaaatgaaatcatgtaatacaattttttattttttcc 
gactgttgtgtattccatcaaactattcaaaaaatcaatataatgattttttttt 
catttttcgcga 


WO 00/54815 


9 / 12 


PCT/EP00/02373 


Bsa I (360) 


Hind III 
Sac I (2553\ 
Sac I (2472) 
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SEQUENCE LISTING 


<U0> DE7GEH MV 

<120> Expression of DMA or proteins in. 

<130> SC3/51336/002 

<140> 
<141> 

<160> 11 

<170> Patentln Ver. 2.0 


c-legans 


<210> 1 

<211> 6070 

<212> DNA 

<213> Caenorhabditis elegans 


<400> 1 

gatcgctcga 

gctgaatccg 

tctgtcgtta 

agggggttcc 

gaaccaattg 

aagtctctct 

cgccgagacc 

aaaaaagaaa 

ggttattcaa 

taaatacatg 

ttttccgaca 

gcaaccaaag 

tttttatttt 

ggcttttcag 

gaattttttc 

aacactggaa 

gaaaaaaaaa 

atataattac 

gctcatagaa 

acacgtaqca 

gtgcaaggat 

tgcatgtaca 

tgttttttta 

tttctctaaa 

aagaaaat ta 

agtcgaagtc 

gtcatttaaa 

ctagcagaat 

tcgctccaaa 

acccagaatg 

tcttcgtgtg 

agatgtcgtc 

ggtaattgaa 

acaaataatt 

atatttttaa 

ttttttgaga 

accacaaact 

atttcccgta 

aaccggaatt 


ataaaaaati 
t-rttgattc 
ttaaagtgcg 
ccaactattg 
aagagacgca 
gaatcgtgtg 
gaaataagag 
cagatgacgt 
gtaatgtagc 
tttttatata 
ctacttgaat 
gcgccggatt 
tattacttga 
ttgtccggaa 
gatgaaacct 
caacacaaaa 
tcgaartaau 
rtcttaataa 
gaacacacga 
gggaaatgat 
tcggtatata 
auctagaaaa 
cz ctaaatta 
tcttgaaaaa 
cagaaaaacc 
taagaatcaa 
gcagttgact 
atcggttcct 
cgattttggc 
tcgtcattcc 
ccggtgctgc 
gagatggagc 
ccggagtctt 
gactagaaga 
aatttaaata 
agtctcacta 
atgctacagt 
tccaat tcta 
ccatgaaaac 


ttataatgtc 
tctagaattt 
caattaacaa 
cgaaattctc 
gtgctaggaa 
ctctccacat 
agaaaacaga 
ggcaattctt 
aaaatgtatt 
cttgaataag 
aaccccaaaa 
tttgaatttt 
taaaaattga 
aatcgaaaat 
ttgattgcaa 
tcacaataat 
tgtaatagtt 
atatataacc 
aaattaggta 
gtaaaatgga 
aatcatataC 
gagcagaagg 
cagaaaatgg 
tgattttttc 
agtaaaattt 
aaaagaaatt 
ccactccacc 
caaatacgga 
gacaaccaga 
tccagttgat 
accaacttcc 
aatgtagaga 
tccaagtgaa 
tgaaacctga 
aaagacgtgt 
tgaaactcgg 
tgtaaaaca t 
tatccactcg 
tcatttcaco 


agattacgtt 
tgtttagcca 
atattcaaaa 
tttctctctc 
gggaatgcgc 
tgggtcgact 
gggaaaaaag 
ctttccggag 
taaatacatt 
gttgtgacgt 
gtgtacggtt 
tcaaaaatcg 
attttatggt 
gtttgttttt 
tgztagaaaa 
caacaccgga 
ctgaaaatct 
tacagagaaa 
agttaaggct 
aaagagagaa 
gaagggccct 
gt tatcgaaa 
acactaacga 
aattttgcac 
gagaatttga 
aaaatttttt 
tttaaaacaa 
ttatagggtg 
tgaaaagcga 
tcsccagatc 
gacacgtgaa 
agagatgtta 
actgggctat 
aagaaacagt 
ttcagctcct 
ggccagtcca 
tctataaact 
aatagtacca 
atacaaatca 


ttagatccaa 
aaaaacatcg 
attcttattt 
ggacattact 
cctggcgcca 
ccgcgagtgc 
atagggaaca 
ttttttttta 
tgtgacgtca 
aatttttcta 
tcttttttca 
gaagatttaa 
gaaatttcaa 
cgattcccac 
atttctaaaa 
ccattggtac 
ttcataaata 
aaaatttcgg 
gaaggcattt 
aatatataaa 
tattaaggaa 
gggcggagta 
cggaatataa 
ctctcaaatt 
aacggatatc 
ttagttacgg 
cataaattat 
tcatatcgcc 
cgacgacaac 
gatgcatttc 
tggcatttct 
ctggtttcgt 
cgaggcaccc 
ctagagtatt 
t tgacaagga 
gtcCatttaa 
at taccataa 
acaacaccca 
tgagcaccca 


aaaaaattag 
cattgttctc 
aaaaaaccat 
atgagagatt 
catatggtct 
ttttgagcgg 
ttaattgatg 
ttggaagtgg 
caaatgtatt 
cactttttaa 
aaacacgatt 
aattttcgct 
aatttgtttt 
tgaaaaaatc 
aattaacaag 
cgtataaaaa 
cccattttga 
ggacgttaag 
ggcctactct 
acatatgaat 
tttgttttcc 
gagtcaaaat 
ctataaacat 
aagagcacta 
tccaaaaagt 
tatattttct 
ctaatgaggc 
ttttctccat 
aaaaacgaac 
aataacttct 
taggagttca 
gacttttact 
accatcacaa 
tccaggataa 
tttttattca 
gcaacaagaa 
aacaaggacg 
tattcgaata 
atgctacccc 
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caaacgagaa 
aaccgcccca 
a r ccacqaqc 
qagaaga 1 1 t 
ccataaacat 
ogccccatgc 
aactctiatct 
atgcatcttg 
ctggaaaaat 
gcatgtaaac 
atgttttaaa 
acagcatacg 
c gagcccacc 
taaatgttct 
acttgtatca 
acacatgcac 
gctaaccatt 
cgtaacctca 
ttccacatcg 
aatatatcca 
aaaaaaagta 
aattttaact 
cctcgcgaga 
tttggatatc 
acgaaagcgc 
aattgtttgt 
gttgcatgca 
cgttaaaaat 
aacaaacatc 
ctgttttgaa 
tgcagaatct 
aacgctgaaa 
ttttgataaa 
at ttcgtgca 
tgattactgt 
catcattcat 
taactttaaa 
actggaaaa t 
aaaaaaacaa 
ccagatgcga 
taaatctccc 
gacaaaatgc 
tcgcttgatt 
catcgacgaa 
ttggatattc 

CGGCttttCC 

attcagcgaa 
atcaaactat 
art ttgtcgt 
ccgcagccat 
teat tgagca 
t caacattga 
t tctaaaaaa 
gt tcaacaac 
cttcaacgca 
gtgagagaaa 
gt tctgaatc 
gagcegtact 
atccagctat 
tcgtccttcg 


tegtgataac 
gtgctcaaaa 
tcaaaaagt t 
gaatgaccag 
cccactccaa 
ccaat tgttg 
ccgtttaatg 
ctccatccgt 
eggatagcat 
cgaaatgact 
tatacctttt 
at gctctagc 
agttggaact 
ttttgtaatt 
attgaacaaa 
attcgccatc 
gaaaaacatc 
acaggttgtt 
agaacteggt 
tttcccacaa 
aatatgaata 
tttccgctga 
agatgaaaat 
ctcgaaccat 
gecaegt tat 
attgtgaaat 
aggttagttg 
gctgagtaat 
gacaaatatt 
attcaaaaag 
accgtatccc 
aagcgattaa 
aacatgagt t 
ggatgetat t 
agatgtttgg 
ttatctttct 
aaat ctggaa 
cttcaaacag 
tcaatattgg 
ccctctattc 
teggggaaaa 
gtttggtctc 
attggcctcc 
aaatcaacaa 
cattgegtat 
tgtcttgtca 
atgaaatcat 
tcaaaaaatc 
ctgaaaacct 
gggaatcagc 
agaggecaat 
gaaactaagg 
eggaaaaect 
aacgtcaaaa 
aaattcaagc 
aegt t tctca 
ggacacatt t 
cgacgaggct 
tttgaaggga 
t tgccgtgag 


tegatgeagt 
gattgttgca 
t tgtcggagt 
tgtagcagag 
t taggtaata 
gagecegtaa 
tatttggtgc 
caaaatgetc 
cccgagaagc 
cttctctgaa 
ctgtgaaact 
aacttgccgc 
aagacgagac 
ttaatgeate 
atagtttgga 
ctgaaacatt 
gcagaatttt 
gtccgtcaat 
ttttctcctg 
actctacgct 
attactaatt 
acattgatgc 
gacgaataaa 
aegcatttta 
tgcggcgctc 
ttcgaagagg 
cctgtttaag 
cagcttttta 
caagaggega 
tattttcaaa 
atacattttt 
atttatttaa 
tcctttggag 
ttcgtggcga 
caacttattt 
gtgtttttag 
aagaaaatta 
gaaaacccac 
tcaagtaaat 
cacttctctg 
ccgtcaaaaa 
gtcacgaatg 
agtgggcaat 
tgaccaactt 
ttttcttttc 
aaatctggat 
gtaatacaat 
aatataatga 
ttttactaat 
gacaacgacg 
gagaaggctg 
aattaaaaca 
tgaaattatg 
gat tatggaa 
ctccaactct 
acat tttcaa 
gaattaaaaa 
cgctcgaatc 
cttgtcatgc 
aaggatcttc 


caacgattga 
acctcgtttt 
atcccagt ct 
ctcgcttttc 
atattcaatc 
attgetcat t 
tttcgaattc 
cgaatccaaa 
ttcttcactt 
ataaactgaa 
gaaacttttc 
tgatgaaccg 
aagttgagtc 
aatttcaata 
cgagttcgat 
aatgtccaat 
gaaaatgeca 
ttcaatgttg 
aactegtaat 
ttgattgtta 
attcgactca 
tgttttgtga 
acaactccaa 
ctttcgaaat 
attttggagg 
cataaaacaa 
cattatcccc 
gaatttaaaa 
atgatategg 
gtattgtccg 
atagtttttc 
atgcatcgtt 
aaaagtaggg 
aacccatact 
ttacataaaa 
ttagttttag 
aaaaattttg 
cgcgttttgc 
aagaaaaaat 
ttcatctget 
aaggcaaact 
gtgagagaga 
gtcggggaaa 
tttgttacag 
caccttcaaa 
ttttgaatat 
tttttatttt 
tttttttttc 
aaaataattt 
ttcagaagca 
aggagatcga 
tttactcctt 
aattcattca 
t t cttcgaga 
ctcaacgctg 
aaacat taat 
catattttca 
tctcccgtat 
aaggacttct 
gtcttgttga 


acaaatgett 
tggaaagtga 
rqccacaqac 
cgcgccaaa t 
acct ttacag 
aaaa tgtatt 
gcaacactga 
gttttcgagt 
gttggcttcg 
caaattaatt 
caggtgeate 
tcagctcgcc 
ggaccgttga 
aaattacagt 
gageccatat 
aaatagttta 
tttttgaaat 
agecegggat 
tgtccgaatg 
attcctgtat 
catcaacctc 
atgatgtgcc 
taattatege 
ccatattttc 
ggaaatcege 
gaaaaeggae 
gcatgtagct 
tattaaactt 
gaatttcgat 
caaggcacat 
cctcagtttt 
cgaataaaat 
atttegcett 
caagagctca 
aegttttcat 
ctagtttttt 
tccctattat 
ttattgetet 
taacgaatct 
gcttcttttg 
aaatgeaaac 
attggcctcc 
accaaactat 
ttttgttata 
aaatctgetc 
ataattttta 
ttccgactgt 
atttttcgeg 
acagggaaaa 
gctccgccac 
tgetaaagee 
taaaactata 
aattgtttca 
agaaggagaa 
gaegtctteg 
cgccttaaaa 
ggctcgtgaa 
ttceggagat 
teaattgetc 
gcaacttttg 


gtgatgetec 
gcatccatgt 
gega tgggaa 
tt tccagagg 
a::ccaaag: 
ttgt tgea t t 
atcctgttaa 
cttttcctga 
aegcaatega 
aattttttta 
agtacaaaca 
taattttgaa 
tttctgaaat 
tggattggaa 
ctgtgacgtc 
ttatgtcgtg 
ccgtcatttt 
aaaaattggt 
aatcaacacc 
cgaactccta 
tttttcccat 
ggaagacgtt 
taatcttcgt 
tattcgaege 
ctgaaaaatc 
atgaaagege 
tgttcggcac 
ttaaaattgc 
tgaaacgaaa 
cacgcaaact 
taaaattaaa 
aaagtttatc 
tcaaaaaatt 
tgegtcttet 
tcattatttc 
tctaaattcc 
ttattttatt 
atttatgaaa 
ctatctgaca 
tttaaccaga 
aegctctata 
geegcagaga 
tgatgagagg 
aatatgagtt 
caacctttaa 
aaaccatcaa 
tgtgtattcc 
attttttatt 
ccactaacga 
atgatggctt 
gaggaagaat 
ctaaaatctc 
gggaegtett 
acaagtcgag 
ttgcttgaag 
t tgaaaacca 
gaccacatcg 
getgetegtt 
gaaaaggaag 
ccagagtgcc 
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ttgacggact tcaaaaggag tggggaagca ccaccaaggt cgttctcgat aaacaaaact 6000 
tcttgccatc ggagtctgct ggaggagttg aaccrrccgc tcgtgctgga aagatccccg 6060 
ggattggcca 6070 

<210> 2 
<2il> 3224 
<212> DMA 

<213> Caenorhabdit is eiegans 


<400> 2 

aagcttcttc 

tgaaataaac 

aactgaaact 

ccgctgatga 

agacaagttg 

caccaatttc 

tggacgagtt 

cattaatgtc 

ttttgaaaat 

caatttcaat 

cctgaactcg 

cgctttgatt 

aattattcga 

acgctgtttt 

taaaacaact 

tttactttcg 

gctcattttg 

gaggcataaa 

taagcattat 

tttagaattt 

gcgaatgata 

caaagtattg 

tttcatagtt 

ttaaatgcat 

ggagaaaagt 

gcgaaaccca 

atttttacat 

ttagttagtt 

attaaaaaat 

ccaccgcgct 

aaataagaaa 

tctgttcatc 

aaaaaaggca 

aatggtgaga 

caatgtcggg 

act ctttgtt 

tttctacttt 

ggatttttga 

caatttttta 

atgatttttt 

taataaaata 

gacgttcaga 

gctgaggaga 

aacatt tact 

t atgaattca 

ggaatccttc 

ctccctcaac 

tcaaaaacat 

aaaacatatt 

aacctctccc 


acttgttggc 
tgaacaaatt 
tttccaggtg 
accgtcagct 
agtcggaccg 
aataaaatta 
cgatgagccc 
caataaatag 
gccatttttg 
gt tgagcccg 
taattgtccg 
gttaattcct 
ctcacatcaa 
gtgaatgatg 
ccaataatta 
aaatccatat 
gaggggaaat 
acaagaaaac 
ccccgcatgt 
aaaatattaa 
tcgggaattt 
tccgcaaggc 
tttccctcag 
cgttcgaata 
agggatttcg 
tactcaagag 
aaaaacgttt 
ttagctagtt 
tttgtcccca 
ttgcttattg 
aaattaacga 
tgctgcttct 
aactaaatgc 
gagaattggc 
gaaaaccaaa 
acagttt tgt 
caaaaaatct 
atatataatt 
ttttttccga 
tttcattttt 
atttacaggg 
agcagccccg 
tcgatgctaa 
cctttaaaac 
ttcaaattgt 
gagaagaagg 
gctggacgtc 
taatcgcctt 
ttcaggctcg 
gtat tt ccgg 


ttcgacgcaa 
aattaatttt 
catcagtaca 
cgcctaattt 
ttgatttctg 
cagttggatt 
atatctgtga 
tctattatgt 
aaatccgtca 
ggataaaaat 
aatgaatcaa 
gtatcgaact 
tctctttttc 
tgccggaaga 
tcgctaatct 
tttctattcg 
ccgcctgaaa 
ggacatgaaa 
agcttgttcg 
acttttaaaa 
cgattgaaac 
acatcacgca 
tttttaaaat 
aaataaagtt 
cct:tcaaaa 
ctcatgcgtc 
tcattcatta 
tttttctaaa 
ttactcattt 
ctgtatttat 
atctctatct 
tttgtttaac 
aaacacgctc 
ctccgccgca 
ctattgatga 
tataaatatg 
gctccaacct 
tttaaaacca 
ctgttgtgta 
cgcgattttt 
aaaaccacca 
ccacatgatg 
agccgaggaa 
tatactaaaa 
ttcagggacg 
agaaacaagt 
ttcgttgctt 
aaaa ttgaaa 
tgaagaccac 
agatgctgct 


tcgagcatgt 
tttaatgttt 
aacaacagca 
tgaatgagcc 
aaattaaatg 
ggaaacttgt 
cgtcacacat 
cgtggctaac 
tttttgtaac 
tggtttccac 
caccaatata 
cctaaaaaaa 
ccataatttt 
cgttcctcgc 
tcgttttgga 
acgcacgaaa 
aatcaattgt 
gcgcgttgca 
gcaccgttaa 
ttgcaacaaa 
gaaactgttt 
aacttgcaga 
taaaaacqct 
tatcttttga 
aattatttcg 
ttcttgatta 
tttccatcat 
ttcctaactt 
tattactgga 
gaaaaaaaaa 
gacaccagat 
cagataaatc 
tatagacaaa 
gagatcgctt 
gaggtatcga 
agttttggat 
t taatggctt 
ccaaattcag 
ttccatcaaa 
tattattttg 
acgactgtag 
gctttcattg 
gaattcaaca 
tctcttctaa 
tcttgt tcaa 
cgagcttcaa 
gaaggtgaga 
accagttctg 
a tcggagccg 
cgtcatccag 


aaaccgaaat 
taaatatacc 
tacgacgctc 
caccagttgg 
ttctttttgt 
atcaattgaa 
gcacattcgc 
cattgaaaaa 
ctcaacaggt 
atcgagaact 
tccatttccc 
agtaaatatg 
aacttttccg 
gagaagatga 
tatcgtcgaa 
gtgcgccacg 
ttgtattgtg 
tgcaaggtta 
aaatgctgag 
catcgacaaa 
tgaaattcaa 
atctaccgta 
gaaaaagcga 
taaaaacatg 
tgcaggatgc 
ctgtagatgt 
tcatttatct 
taaaaaatct 
aaatcttcaa 
acaatcaata 
gcgaccctct 
tccctcgggg 
atgtgtttgg 
gattattggc 
cgaaaaatca 
attccattgc 
ttcctgtctt 
cgaaacgaaa 
ctattcaaaa 
tcgtctgaaa 
ccatgggaat 
agcaagaggc 
t tgagaaagt 
aaaacggaaa 
caacaacgtc 
cgcaaaattc 
gaaaacgttc 
aatcggacac 
tactcgacga 
ctattttgaa 


gactcttctc 
ttttctgtga 
tagcaacttg 
aactaagacg 
aattttaatg 
caaaatagtt 
catcctgaaa 
catcgcagaa 
tgttgtccgt 
cggtttttct 
acaaactcta 
aataattact 
ctgaacattg 
aaatgacgaa 
ccatacgcat 
ttattgcggc 
aaatttcgaa 
gttgcctgtt 
taatcagctt 
tattcaagag 
aaagtatttt 
tcccatacat 
ttaaatttat 
agtttccttt 
tattttcgtg 
ttggcaactt 
ttctgtgttt 
ggaaaagaaa 
acaggaaaac 
ttggtcaagt 
attccacttc 
aaaaccgtca 
tctcgtcacg 
ctccagtggg 
acaatgacca 
gtacttttct 
gtcaaaatcc 
tcatgtaata 
aatcaatata 
acctttttac 
cagcgacaac 
caatgagaag 
aaggaattaa 
accttgaaat 
aaaagat tat 
aagcctccaa 
ctcaacattt 
at ttgaatta 
ggctcgctcg 
gggact tgtc 
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atgcaaggac tccctcaatt gctcgaaaag gaagccgtcc ttcgttgccg tgagaaggat 3060 

cttcgtcttg ttgagcaact tttgccagag cgccttgacg gacttcaaaa ggagtgggga 3120 

agcaccacca aggtcgttct cgataaacaa aactr.cttgc catcggagtc tgctggagga 3130 

gtcgaacctt ctgctcgtgc tggaaagatc cccgggattg gcca 3224 

<210> 3 
<211> 2167 
<212> DNA 

<213> Caenorhabditis elegans 


<400> 3 

gcatgcaagg 

taaaaatgct 

aaacatcgac 

ttttgaaatt 

agaatctacc 

gctgaaaaag 

tgataaaaac 

tcgtgcagga 

ttactgtaga 

cattcattta 

ctttaaaaaa 

ggaaaatctt 

aaaacaatca 

gatgcgaccc 

atctccctcg 

aaaatgtgtt 

cttgattatt 

cgacgaaaaa 

gatattccat 

cttttcctgt 

cagcgaaatg 

aaactattca 

ttgtcgtctg 

tagccatggg 

ttgagcaaga 

acattgagaa 

taaaaaacgg 

caacaacaac 

caacgcaaaa 

agagaaaacg 

ctgaaccgga 

ccgtactcga 

cagctatttt 

tccttcgttg 

acggact tea 

tgccatcgga 

ttggcca 


ttagttgcct 
gagtaatcag 
aaatattcaa 
caaaaagtat 
gtatcccata 
cgattaaatt 
atgagtttcc 
tgetatttte 
tgtttggcaa 
tctttctgtg 
tctggaaaag 
caaacaggaa 
atattggtca 
tctattccac 
gggaaaaccg 
tggtctcgtc 
ggcctccagt 
tcaacaatga 
tgcgtatttt 
cttgtcaaaa 
aaatcatgta 
aaaaatcaat 
aaaacctttt 
aatcagegae 
ggecaatgag 
agtaaggaat 
aaaaccttga 
gtcaaaagat 
ttcaagcctc 
tttctcaaca 
cacatttgaa 
cgaggctcgc 
gaagggact t 
ccgtgagaag 
aaaggagtgg 
gtctgctgga 


gtttaagcat 
ctttttagaa 
gaggegaatg 
tttcaaagta 
catttttata 
tatttaaatg 
tttggagaaa 
gtggcgaaac 
cttattttta 
tttttagtta 
aaaartaaaa 
aacccaccgc 
agtaaataag 
ttctctgttc 
tcaaaaaaag 
acgaatggtg 
gggcaatgtc 
ccaacttttt 
tcttttctac 
tctggatttt 
atacaatttt 
ataatgattt 
tactaataaa 
aacgaegtte 
aaggctgagg 
taaaacattt 
aattatgaat 
tatggaattc 
caactctctc 
ttttcaaaaa 
ttaaaaacat 
tcgaatctct 
gtcatgcaag 
gatcttegtc 
ggaagcacca 
ggagttgaac 


tatccccgca 
tttaaaatat 
atategggaa 
ttgtccgcaa 
gtttttccct 
categttega 
agtagggatt 
ccatactcaa 
cataaaaacg 
gttttagcta 
aattttgtcc 
gttttgetta 
aaaaaattaa 
atetgetget 
gcaaactaaa 
agagagaatt 
ggggaaaacc 
gttacagttt 
tttcaaaaaa 
tgaatatata 
ttattttttc 
ttttttcatt 
ataatttaca 
agaagcagct 
agatcgatgc 
acccctttaa 
tcattcaaat 
ttcgagaaga 
aacgetggae 
cattaatege 
attttcaggc 
cccgtatttc 
gacttcttca 
ttgttgagca 
ecaaggtegt 
tttctgeteg 


tgtagcttgt 
taaactttta 
tttcgattga 
ggcacatcac 
cagtttttaa 
ataaaataaa 
tcgcctttca 
gagctcatgc 
ttttcattca 
gtttttttct 
ctattattta 
ttgctgtatt 
cgaatctcta 
tcttttgttt 
tgcaaacacg 
ggcctccgcc 
aaactattga 
tgttataaat 
tctgctccaa 
atttttaaaa 
cgactgttgt 
tttcgegatt 
gggaaaacca 
ccgccacatg 
taaagecgag 
aactatacta 
tgtttcaggg 
aggagaaaca 
gtcttcgctg 
ct taaaattg 
tegtgaagae 
eggagatget 
attgetcgaa 
acttttgeca 
tctcgataaa 
tgctggaaag 


tcggcaccgt 

aaattgeaac 

aacgaaactg 

geaaacttge 

aattaaaaac 

gtttatcttt 

aaaaattatt 

gtcttcttga 

ttatttccat 

aaattcctaa 

ttttattact 

tatgaaaaaa 

tctgacacca 

aaccagataa 

ctctatagac 

gcagagatcg 

tgagaggtat 

atgagttttg 

cctttaatgg 

ccat caaatt 

gtattccatc 

ttttattatt 

ctaacgactg 

atggctttca 

gaagaattca 

aaatctcttc 

acgtcttgtt 

agecgagett 

cttgaaggtg 

aaaaccagtt 

cacateggag 

getegttate 

aaggaagtcg 

gagtgccttg 

caaaacttct 

atccccggga 


60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2167 


<210> 4 

<211> 1308 

<212> DNA 

<213> Caenorhabditis elegans 


<400> 4 

gcatgcaagg ttagttgcct gtttaagcat 

taaaaatgct gagtaatcag ctttttagaa 

aaacatcgac aaatattcaa gaggegaatg 

ttttgaaatt caaaaagtat tttcaaagta 

agaatctacc gtatcccata catttttata 


tatccccgca tgtagcttgt tcggcaccgt 60 

tttaaaatat taaactttta aaattgeaac 120 

atategggaa tttcgattga aacgaaactg 180 

ttgtccgcaa ggcacatcac geaaacttge 240 

gtttttccct cagtttttaa aattaaaaac 300 
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gctgaaaaag cgattaaatt tatttaaatg catcgttcga ataaaataaa gtttatcttt 360 
tgataaaaac atgagtttcc tttggagaaa agtagggatt tcgcctttca aaaaattatt 420 
tcgtgcagga tgctattttc gtggcgaaac ccatacccaa gagctcatgc gtcttcttga 430 
ttactgtaga tgtttggcaa cttattttta cataaaaacg ttttcattca ttatttccat 540 
cattcattta tctctctgtg tttttagtta gttttagcta gtttttttct aaattcctaa 600 
ctccaaaaaa cccggaaaag aaaattaaaa aaccttgtcc ctattattta ttttactact 660 
ggaaaatctt caaacaggaa aacccaccgc gttttgctta ttgctgtatt tatgaaaaaa 720 
aaaacaatca atattggtca agtaaataag aaaaaattaa cgaatctcta tctgacacca 780 
gatgcgaccc tctattccac ttctctgttc atctgctgct tcttttgttt aaccagataa 840 
atctccctcg gggaaaaccg tcaaaaaaag gcaaactaaa tgcaaacacg ctctatagac 900 
aaaatgtgtt tggtctcgtc acgaatggtg agagagaatt ggcctccgcc gcagagatcg 960 
cttgattatt ggcctccagt gggcaacgtc ggggaaaacc aaactattga tgagaggtat 1020 
cgacgaaaaa tcaacaatga ccaacttttt gttacagttt tgttataaat atgagttttg 1080 
gatattccat tgcgtatttt tcttttctac tttcaaaaaa tctgctccaa cctttaatgg 1140 
cttttcctgt cttgtcaaaa tctggatttt tgaatatata atttttaaaa ccatcaaatt 1200 
cagcgaaatg aaatcatgta atacaatttt ttattttttc cgactgttgt gtattccatc 1260 
aaactattca aaaaatcaat ataatgattt ttttttcatt tttcgcga 1308 


<210> 5 

<211> 1707 

<212> DNA 

<213> Caenorhabditis elegans 


<400> 5 

gagctcatgc 

ttttcattca 

gtttttttct 

ctattattta 

ttgctgtatt 

cgaatctcta 

tcttttgttt 

tgcaaacacg 

ggcctccgcc 

aaactattga 

tgttataaat 

tctgctccaa 

atttttaaaa 

cgactgttgt 

tttcgcgatt 

gggaaaacca 

ccgccacatg 

taaagccgag 

aactatacta 

tgtttcaggg 

aggagaaaca 

gtct tcgttg 

cttaaaattg 

tcgtgaagac 

cggagatgct 

attgctcgaa 

acttttgcca 

tctcgataaa 

cgctggaaag 


gtcttcttga 
ttatttccat 
aaattcctaa 
ttttattact 
tatgaaaaaa 
tctgacacca 
aaccagataa 
ctctatagac 
gcagagatcg 
tgagaggtat 
atgagttttg 
cctttaatgg 
ccatcaaatt 
gtattccatc 
ttttattatt 
ctaacgactg 
atggctttca 
gaagaattca 
aaatctcttc 
acgtcttgtt 
agtcgagctt 
cttgaaggtg 
aaaaccagtt 
cacatcggag 
gctcgttatc 
aaggaagtcg 
gagtgccttg 
caaaacttct 
atccccggga 


ttactgtaga 
cattcattta 
ctttaaaaaa 
ggaaaatctt 
aaaacaatca 
gatgcgaccc 
atctccctcg 
aaaatgtgtt 
cttgattatt 
cgacgaaaaa 
gatattccat 
cttttcctgt 
cagcgaaatg 
aaactattca 
ttgtcgtctg 
tagccatggg 
ttgagcaaga 
acattgagaa 
taaaaaacgg 
caacaacaac 
caacgcaaaa 
agagaaaacg 
ctgaatcgga 
ccgtactcga 
cagctatttt 
tccttcgttg 
acggacttca 
tgccatcgga 
ttggcca 


tgtttggcaa 
tctttctgtg 
tctggaaaag 
caaacaggaa 
atattggtca 
tctattccac 
gggaaaaccg 
tggtctcgtc 
ggcctccagt 
tcaacaatga 
tgcgtatttt 
cttgtcaaaa 
aaatcatgta 
aaaaatcaat 
aaaacctttt 
aatcagcgac 
ggccaatgag 
agtaaggaat 
aaaaccttga 
gtcaaaagat 
ttcaagcctc 
tttctcaaca 
cacatttgaa 
cgaggctcgc 
gaagggactt 
ccgtgagaag 
aaaggagtgg 
gtctgctgga 


cttattttta 
tttttagtta 
aaaattaaaa 
aacccaccgc 
agtaaataag 
ttctctgttc 
tcaaaaaaag 
acgaatggtg 
gggcaatgtc 
ccaacttttt 
tcttttctac 
tctggatttt 
atacaatttt 
ataatgattt 
tactaataaa 
aacgacgttc 
aaggctgagg 
taaaacattt 
aattatgaat 
tatggaattc 
caactctctc 
ttttcaaaaa 
ttaaaaacat 
tcgaatctct 
gtcatgcaag 
gatcttcgtc 
ggaagcacca 
ggagt tgaac 


cataaaaacg 
gttttagcta 
aattttgtcc 
gttttgctta 
aaaaaattaa 
atctgctgct 
gcaaactaaa 
agagagaatt 
ggggaaaacc 
gttacagttt 
tttcaaaaaa 
tgaatatata 
ttattttttc 
ttttttcatt 
acaatctaca 
agaagcagct 
agatcgatgc 
actcctttaa 
tcattcaaat 
ttcgagaaga 
aacgctggac 
cattaatcgc 
attttcaggc 
cccgtatttc 
gacttcttca 
ttgttgagca 
ccaaggtcgt 
t ttctgctcg 


60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1707 


<210> 6 

<211> 789 

<212> DNA 

<213> Caenorhabditis elegans 


<400> 6 
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t tttcattca 
gtttttttct 
ctattattta 
t tgctgtat t 
cgaatctcta 
tctcttgccc 
tgcaaacacg 
ggcctccgcc 
aaactattga 
tgttataaat 
tctgctccaa 
atttttaaaa 
cgactgttgt 
tttcgcgat 


ccactcccat 
aaattcctaa 
ttttactacc 
tacgaaaaaa 
tc:gacacca 
aaccagataa 
ctctatagac 
gcagagatcg 
tgagaggtat 
atgagttttg 
cctttaatgg 
ccatcaaatt 
gtattccatc 


catccactta 
ctttaaaaaa 
ggaaaatctt 
aaaacaatca 
gatgcgaccc 
atctccctcg 
aaaatgtgt t 
cttgattatt 
cgacgaaaaa 
gatattccat 
cttttcctgt 
cagcgaaatg 
aaactattca 


tctttctgtg 
tctggaaaag 
caaacaggaa 
atattggtca 
tc^attccac 
gggaaaaccg 
tggtctcgtc 
ggcctccagt 
tcaacaatga 
tgcgtatttt 
cttgtcaaaa 
aaatcatgta 
aaaaatcaat 


tttttagtta 
aaaattaaaa 
aacccaccgc 
agtaaataag 
t tctctgttc 
tcaaaaaaag 
acgaatggtg 
gggcaatgr c 
ccaacttttt 
tcttttctac 
tctggatttt 
atacaatttt 
ataatgattt 


<210> 7 
<211> 397 
<212> DNA 

<213> Caenorhabditis elegans 


gtt ttagcta 
aattttgtcc 
gttttgetta 
aaaaaat taa 
atetgetget 
gcaaactaaa 
agagagaatt 
ggggaaaacc 
gttacagttt 
tttcaaaaaa 
tgaatatata 
ttattttttc 
ttttttcatt 


60 

120 

130 

240 

30O 

360 

420 

480 

540 

600 

660 

720 

780 

789 


<400> 7 

ggtctegtea 

gcctccagtg 

caacaatgac 

gcgtattttt 

ttgtcaaaat 

aatcatgtaa 

aaaatcaata 


cgaatggtga 
ggcaatgtcg 
caactttttg 
cttttctact 
ctggattttt 
tacaattttt 
taatgatttt 


gagagaattg 
gggaaaacca 
ttacagtttt 
ttcaaaaaat 
gaatatataa 
tattttttcc 
tttttcattt 


gcctccgccg 
aactattgat 
gttataaata 
ctgctccaac 
tttttaaaac 
gactgttgtg 
ttcgega 


cagagatege 
gagaggtatc 
tgagttttgg 
ctttaatggc 
catcaaattc 
tattccatca 


ttgattattg 60 
gacgaaaaat 120 
atattccatt 180 
ttttcctgtc 240 
agegaaatga 300 
aactattcaa 360 
397 


<210> 8 
<211> 6202 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: plasmid 
pGF2006 


<400> 8 

agettgeatg 

accgttaaaa 

gcaacaaaca 

aactgttttg 

ettgeagaat 

aaaacgctga 

tcttttgata 

ttatttcgtg 

cttgattact 

tccatcattc 

cctaacttta 

ttactggaaa 

aaaaaaaaac 

caccagatgc 

gataaatctc 

tagacaaaat 

gategcttga 

ggtatcgacg 

ttttggatat 

aatggctttt 

aaattcagcg 


caaggttagt 
atgctgagta 
tcgacaaata 
aaattcaaaa 
ctaccgtatc 
aaaagegatt 
aaaacatgag 
caggatgeta 
gtagatgttt 
atttatcttt 
aaaaatctgg 
atcttcaaac 
aatcaatatt 
gaccctctat 
ecteggggaa 
gtgtttggtc 
ttattggcct 
aaaaatcaac 
tecattgegt 
cctgtcttgt 
aaatgaaatc 


tgcctgttta 
atcagctttt 
ttcaagaggc 
agtattttca 
ccatacattt 
aaatttattt 
tttcctttgg 
ttttcgtggc 
ggcaacttat 
ctgtgttttt 
aaaagaaaat 
aggaaaaccc 
ggtcaagtaa 
tccacttctc 
aacegtcaaa 
tegtcacgaa 
ccagtgggca 
aatgaccaac 
atttttcttt 
caaaatctgg 
atgtaataca 


agcattatcc 
tagaatttaa 
gaatgatatc 
aagtattgtc 
ttatagtrtt 
aaatgeateg 
agaaaagtag 
gaaacccata 
ttttacataa 
agttagtttt 
taaaaaattt 
accgcgttt t 
ataagaaaaa 
tgttcatctg 
aaaaggcaaa 
tggtgagaga 
atgtcgggga 
tttttgttac 
tctactttca 
atttttgaat 
attttttatt 


ccgcatgtag 
aatattaaac 
gggaatttcg 
cgcaaggcac 
tccctcagtt 
ttcgaataaa 
ggatttcgee 
ctcaagagct 
aaacgttttc 
agctagtttt 
tgtccctatt 
gettattget 
attaacgaat 
ctgcttcttt 
etaaatgeaa 
gaattggcct 
aaaccaaact 
agttttgtta 
aaaaatctgc 
atataatttt 
ttttccgact 


cttgttcggc 
ttttaaaatt 
attgaaacga 
ateaegcaaa 
tttaaaatta 
ataaagttta 
tttcaaaaaa 
catgegtett 
attcattatt 
tttctaaatt 
atttatttta 
gtatttatga 
ctctatctga 
tgtttaacca 
acacgctcta 
ccgccgcaga 
attgatgaga 
taaacatgag 
tccaaccttt 
taaaaccatc 
gttgtgtatt 


60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 


WO 00/54815 


7 


PCT/EPOO/02373 


ccatcaaact atccaaaaaa tcaatataat 
gatccccggg gattggccaa aggacccaaa 
agaacatttt caggaggacc cttgcttggc 
tgttctttat cn-tctggca ccagcatccg 
ccgacttttt gcaaattacg agcgttgtag 
acatcatcat ccattcactc attggtacat 
ctctggagtt ccgcgctccc tttttccctt 
cattgattgg atccccggga ttggccaaag 
cacaacatag aacattttca ggaggaccct 
gactgcccca aagaagaagc gtaaggtacc 
tttcactgga gttgtcccaa ttcttgttga 
ctctgtcagt ggagagggtg aaggtgatgc 
ttgcactact ggaaaactac ctgttccatg 
cccrgattat ttaaattttc agccaacact 
cttcccgaga cacccagacc atatgaaacg 
aggttatgca caggaaagaa ctatattttt 
ctttaaacag ttcggtacta actaaccata 
gtttgaaggt gatacccttg ttaatagaat 
cggaaacatc cttggacaca aattggaata 
ggcagacaaa caaaagaatg gaatcaaagt 
ctaatccgat ctaaaitttc agaacttcaa 
ccaactagca gaccattatc aacaaaatac 
agacaaccat caccrgtcca cacaatctgc 
ccacatggtc cctcttgagt ttgtaacagc 
atacaaatag cactcgtaga attccaactg 
cggtgtcaaa aataataggg gccgctgtca 
ccaacgagta atatttaaat tttcagcatc 
attactcttc aacatcccta catgctcttt 
attatcaaaa aaacttcttc ttaatttctt 
acaatgaaat tgtgtagatt caaaaataga 
tgtgctccct ccccccatta ataataattc 
cacttcttat gtttttttta cttctgataa 
cacacaaaat accttatcat atgttacgtt 
gcacctctgg gcctctcatg acgtcaaatc 
tttggaactt ttcaaccaag tgaaagttta 
ggggcttccc ctattgtttg tcaagagttt 
aagtattgat gagcacgatg caagaaagat 
gaaggtgagc agaagttgat aatttgaaag 
aatgacagaa cacattccca atataccaaa 
ggccctttcg tctcgcgcgt ttcggtgatg 
cggagacggt cacagcttgt ctgtaagcgg 
cgtcagcggg tgttggcggg tgtcggggct 
tactgagagt gcaccatatg cggtgtgaaa 
gcatcaggcg gccttaaggg cctcgtgata 
ataatggttt cttagacgtc aggtggcact 
tgtttatttt tctaaataca ttcaaatatg 
acgcttcaat aatattgaaa aaggaagagt 
attccctttt ttgcggcatt ttgccttcct 
gtaaaagacg ctgaagatca gttgggtgca 
agcggtaaga cccttgagag ttttcgcccc 
aaagttctgc tacgtggcgc ggtattatcc 
cgccgcatac actattctca gaatgacttg 
cttacggatg gcatgacagt aagagaatta 
actgcggcca acctacttct gacaacgatc 
cacaacatgg gggatcatgt aacccgcctt 
ataccaaacg acgagcgtga caccacgatg 
ctattaactg gcgaactact tactctagct 
gcggacaaag ttgcaggacc acttctgcgc 
gacaaatccg gagccggtga gcgtgggtct 
ggtaagccct cccgtatcgc agtcatctac 


gatttttttt tcatttttcg gactctagag 1320 
ggtatgtttc gaatgatact aacataacac 1330 
tagcaaaaat gcataaggtt ttgctggcac 1440 
cactggcagt ctccgaaccg gcccgcagga 1500 
ggggcggacg ataggtccca taggcnrtgr. 1560 
tcatctaccc accttcctct ttctgagctit 1620 
acctttacac cgtaatttct aactttcagg 1680 
gacccaaagg tatgtttcga acgatactaa 1740 
tgcttggagg gtaccgagct cagaaaaaat 1800 
ggtagaaaaa atgagtaaag gagaagaact 1860 
attagatggt gatgttaatg ggcacaaatt 1920 
aacatacgga aaacttaccc ttaaatttat 1980 
ggtaagttta aacatatata tactaactaa 2040 
tgtcactact ttctgttatg gtgttcaatg 2100 
gcatgacttt ttcaagagtg ccatgcccga 2160 
caaagatgac gggaactaca agacacgtaa 2220 
catatttaaa ttttcaggtg ctgaagtcaa 2280 
cgagttaaaa ggtattgatt ttaaagaaga 2340 
caactataac tcacacaatg tatacatcat 2400 
tgtaagttta aacatgattt tactaactaa 2460 
aattagacac aacattgaag atggaagcgt 2520 
tccaattggc gatggccctg tccttttacc 2580 
cctttcgaaa gatcccaacg aaaagagaga 2640 
tgctgggatc acacatggca tggatgaact 2700 
agcgccggtc gctaccatta ccaacttgtc 2760 
tcagagtaag tttaaactga gttctactaa 2820 
tcgcgcccgt gcctctgact tctaagtcca 2880 
ctccctgtgc tcccaccccc tatttttgtt 2940 
tgttttttag cttcttttaa gtcacctcta 3000 
attaattcgt aataaaaagt cgaaaaaaat 3060 
tatcccaaaa tctacacaat gttctgtgta 3120 
attttttttg aaacatcata gaaaaaaccg 3180 
tcagtttatg accgcaattt ttatttcttc 3240 
atgctcatcg tgaaaaagtt ttggagtatt 3300 
tgaaattaat tttcctgctt ttgctttttg 3360 
cgaggacggc gtttttcttg ctaaaatcac 3420 
cggaagaagg tttgggtttg aggctcagtg 3480 
tggagtagtg tctatggggt ttttgcctta 3540 
cataactgtt tcctactagt cggccgtacg 3600 
acggtgaaaa cctctgacac atgcagctcc 3660 
atgccgggag cagacaagcc cgtcagggcg 3720 
ggcttaacta tgcggcatca gagcagattg 3780 
taccgcacag atgcgtaagg agaaaatacc 3840 
cgcctatttt taraggttaa tgtcatgata 3900 
tttcggggaa atgtgcgcgg aacccctatt 3960 
tatccgctca tgagacaata accctgataa 4020 
atgagtattc aacatttccg tgtcgccctt 4080 
gtttttgctc acccagaaac gctggngaaa 4140 
cgagtgggtt acatcgaact ggatctcaac 4200 
gaagaacgtt ttccaatgat gagcactttt 4260 
cgtattgacg ccgggcaaga gcaactcggt 4320 
gttgagtact caccagtcac agaaaagcat 4380 
tgcagtgctg ccataaccat gagtgataac 4440 
ggaggaccga aggagctaac cgcttttttg 4500 
gatcgttggg aaccggagct gaatgaagcc 4560 
cctgtagcaa tggcaacaac gttgcgcaaa 4620 
tcccggcaac aattaataga ctggatggag 4680 
tcggcccttc cggctggctg gtttattgct 4740 
cgcggtatca ttgcagcact ggggccagat 4800 
acgacgggga gtcaggcaac tatggatgaa 4860 
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cgaaacagac 
caagtttact 
taggtgaaga 
caccgagcg: 
cgcgtaacct 
gaccaagagc 
aatactgtcc 
cctacatacc 
tgtcttaccg 
acggggggtt 
ctacagcgtg 
ccggtaagcg 
tggtatcttt 
tgctcgtcag 
ctggcctttt 
gataaccgta 
cgcagcgagt 
gcgcgttggc 
agtgagcgca 
tttatgcttc 
aacagctatg 
attctcattt 
ctaacaactt 


agatcgctga 
cacacatact 
cccc c t ttga 
cagaccccgc 
gccgct tgca 
taccaactct 
tzctagtgta 
tcgctctgct 
ggttggactc 
cgtgcacaca 
agcattgaga 
gcagggtcgg 
atagtcctgt 
gggggcggag 
gctggccttt 
ttaccgcctt 
cagtgagcga 
cgattcatta 
acgcaattaa 
cggctcgtat 
accatgatta 
aaattttcag 
ggaaatgaaa 


gataggtgcc 
ttagattgat 
taatctcatg 
agaaaagatc 
aacaaaaaaa 
t tttccgaag 
gccgtagtta 
aatcctgtta 
aagacgatag 
gcccagcttg 
aagcgccacg 
aacaggagag 
cgggtttcgc 
cctatggaaa 
tgctcacatg 
tgagtgagct 
ggaagcggaa 
atgcagctgg 
tgtgagttag 
gttgtgtgga 
cgccaagctg 
agcttaaaaa 
ta 


tcactgatta 
ttaaaacttc 
accaaaatcc 
aaaggatctt 
ccaccgctac 
gcaactggct 
ggccaccact 
ccagtggctg 
ttaccggata 
gagcgaacga 
cttcccgaag 
cgcacgaggg 
cacctctgac 
aacgccagca 
ttctttcctg 
gataccgctc 
gagcgcccaa 
cacgacaggt 
ctcactcatt 
attgtgagcg 
taagtttaaa 
tggctgaaat 


agcattggta 
atttttaatt 
ct taacgtga 
cttgagatcc 
cagcggtggt 
tcagcagagc 
tcaagaactc 
ctgccagtgg 
aggcgcagcg 
cctacaccga 
ggagaaaggc 
agcttccagg 
ttgagcgtcg 
acgcggcctt 
cgttatcccc 
gccgcagccg 
tacgcaaacc 
ttcccgactg 
aggcacccca 
gataacaatt 
catgatctta 
cactcacaac 


accgtcagac 
taaaaggatc 
gttttcgttc 
ctttt ttctg 
t tgt »: tgccg 
gcaga caeca 
tgtagcaccg 
cgacaagtcg 
gtcgggctga 
actgagatac 
ggacaggtat 
gggaaacgee 
atttttgtga 
tttacggttc 
tgattctgtg 
aacgaccgag 
gcctctcccc 
gaaageggge 
ggctttacac 
tcacacagga 
ctaactaact 
gatggatacg 


4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6202 


<210> 9 
<211> 5737 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
pGF2009 


plasmid 


<400> 9 

accctctatt 

cteggggaaa 

tgtttggtct 

tattggcetc 

aaaatcaaca 

ecattgegta 

ctgtcttgtc 

aatgaaatca 

ttcaaaaaat 

attggccaaa 

aggaggaccc 

t ttctggcac 

caaattacga 

cattcattca 

tgtgcttcct 

tccccgggat 

acattttcag 

agaagaagcg 

ctgtcccaat 

gagagggtga 

gaaaactacc 

caaatt ttca 

acccagatca 

aggaaagaac 

ccggtactaa 

atacccttgt 


ccacttctct 
acegtcaaaa 
cgtcacgaat 
cagtgggcaa 
atgaccaact 
tttttctttt 
aaaatctgga 
tgtaatacaa 
caatataatg 
ggacccaaag 
ttgcttggct 
cagcatccgc 
gcgttgtagg 
ttggtacatt 
ttttccctta 
tggecaaagg 
gaggaccctt 
taaggtaccg 
tcttgttgaa 
aggtgatgea 
tgttccatgg 
gccaacactt 
tatgaaaegg 
tatatttttc 
ctaaccatac 
taatagaatc 


gttcatctgc 
aaaggcaaac 
ggtgagagag 
tgtcggggaa 
ttttgttaca 
ctactttcaa 
tttttgaata 
ttttttattt 
attttttttt 
gtatgtttcg 
agcaaaaatg 
actggcagtc 
gggeggaega 
catttaccca 
tctttatact 
acccaaaggt 
gcttggaggg 
gtagaaaaaa 
ttagatggtg 
acataeggaa 
gtaagtttaa 
gtcactactt 
catgactttt 
aaagatgacg 
atat ttaaac 
gagttaaaag 


tgcttctttt 
taaatgcaaa 
aattggcctc 
aaccaaacta 
gttttgttat 
aaaatctget 
tataattttt 
tttccgactg 
catttttegg 
aatgatacta 
cataaggttt 
tccgaaccgg 
taggtcctat 
ccttcctctt 
gtaattttta 
atgtttcgaa 
taccgagctc 
tgagtaaagg 
atgttaatgg 
aacttaccct 
acatatatat 
tctgttatgg 
tcaagagtgc 
ggaactacaa 
tttcaggtgc 
gtattgattt 


gt t taaccaq 
cacgctctat 
cgccgcagag 
ttgatgagag 
aaatatgagt 
ccaaccttta 
aaaaccatca 
ttgtgtattc 
actctagagg 
acataacata 
tgctggcact 
ectgeaggat 
aggttttgta 
tctgagcttc 
actttcaggc 
tgatactaac 
agaaaaaatg 
agaagaactt 
gcacaaattt 
taaatttatt 
actaactaac 
tgttcaacgc 
catgcccgaa 
gaeaegtaag 
tgaagtcaag 
taaagaagat 


ataaatctcc 
agacaaaatg 
ategcttgat 
gtatcgacga 
tttggatatt 
atggcttttc 
aattcagega 
catcaaacta 
atccccgggg 
gaacattttc 
gttctttatc 
cgattttttg 
tatcatcatt 
tctggagttc 
attgattgga 
ataacataga 
actgctccaa 
ttcactggag 
tctgtcagtg 
tgcactactg 
cctgattatt 
ttctcgagat 
ggttatgtac 
tttaaacagt 
tttgaaggtg 
ggaaacattc 


60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 
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ttggacacaa attggaatac aactataacc 
aaaagaatgg aatcaaagct gtaagtittaa 
taaatttcca gaacttcaaa attagacaca 
accattatca acaaaatact ccaaccggcg 
acccgtccac acaatctgcc ctttcgaaag 
ttcttgagtt tgtaacagcc gccgggacca 
attcgtagaa ctccaactga gcgccggtcg 
ataatagggg ccgctgtcat cagagtaagt 
tatttaaatt ttcagcatct cgcgcccgtg 
acatccctac atgctctttc tccctgtgct 
aacttcttct taatttcttt gttttttagc 
gcgtagattc aaaaatagaa ttaattcgta 
cccccattaa taataattct atcccaaaat 
ttttttttac ttctgataaa ttttttttga 
ccttatcata tgttacgttt cagtttatga 
cctctcatga cgtcaaatca tgctcatcgt 
tcaatcaagt gaaagtctat gaaattaatt 
tattgtttgt caagagtttc gaggacggcg 
agcacgatgc aagaaagatc ggaagaaggt 
gaagttgata atttgaaagt ggagtagtgt 
acattcccaa tataccaaac ataactgttt 
ctcgcgcgtt tcggtgatga cggtgaaaac 
acagcttgtc cgraagcgga tgccgggagc 
gttggcgggt gtcggggctg gcttaactat 
caccatatgc ggtgtgaaat accgcacaga 
ccttaagggc ctcgtgatac gcctattttt 
ttagacgtca ggtggcactt ttcggggaaa 
ctaaatacat tcaaatatgt atccgctcat 
atattgaaaa aggaagagta tgagtattca 
tgcggcattt tgccttcctg tttttgctca 
tgaagatcag ttgggtgcac gagtgggtta 
ccttgagagt tttcgccccg aagaacgttt 
atgtggcgcg gtattatccc gtattgacgc 
ctattctcag aatgacttgg ttgagtactc 
catgacagta agagaactat gcagtgccgc 
cttacttctg acaacgatcg gaggaccgaa 
ggatcatgta actcgccttg atcgttggga 
cgagcgtgac accacgatgc ctgtagcaac 
cgaactactt accctagctt cccggcaaca 
tgcaggacca cttctgcgct cggcccttcc 
agccggtgag cgtgggtctc gcggtatcat 
ccgtatcgta gttatctaca cgacggggag 
gatcgctgag ataggtgcct cactgattaa 
atatatactt tagattgatt taaaacttca 
cctttttgat aacctcatga ccaaaatccc 
agaccccgta gaaaagatca aaggatcttc 
ctgcttgcaa acaaaaaaac caccgctacc 
accaactctt tttccgaagg taactggctt 
tctagtgtag ccgtagttag gccaccactt 
cgctctgcta accctgttac cagtggctgc 
gttggactca agacgatagt taccggataa 
gtgcacacag cccagcttgg agcgaacgac 
gcattgagaa agcgccacgc ttcccgaagg 
cagggtcgga acaggagagc gcacgaggga 
tagtcctgtc gggtctcgcc acccctgact 
ggggcggagc ctatggaaaa acgccagcaa 
ctggcctttt gctcacacgt tctcccctgc 
caccgccttt gagtgagctg acaccgctcg 
agcgagcgag gaagcggaag agcgcccaat 
gattcattaa tgcagctggc acgacaggtt 


cacacaatgt atacatcatg gcagacaaac 1620 
acacgacttt actaactaac taatctgatt 1680 
acattgaaga tggaagcgtt caactzagcag 1740 
atggccctgr ccttttacca gacaaccatf. 1300 
atcccaacga aaagaga^ac cacatggtcc 1360 
cacatggcac ggatgaacta cacaaatagc 1920 
ctaccattac caacttgcct ggtgtcaaaa 1930 
ttaaactgag ttctactaac taacgagtaa 2040 
cctctgactt ctaagtccaa ttactcttca 2100 
cccaccccct atttttgtta ttatcaaaaa 2160 
ttcttttaag tcacctctaa caatgaaatt 2220 
ataaaaagtc gaaaaaaatt gtgctccctc 2280 
ctacacaatg ttctgtgtac acttcttatg 2340 
aacatcatag aaaaaaccgc acacaaaata 2400 
ccgcaatttt tatttcttcg cacgtctggg 2460 
gaaaaagttt tggagtattt ttggaatttt 2520 
ttcctgcttt tgctttttgg gggtttcccc 2580 
tttttctrgc taaaatcaca agtattgatg 2640 
ttgggtttga ggctcagtgg aaggtgagta 2700 
ctatggggtt tttgccttaa atgacagaat 2760 
cctactagtc ggccgtacgg gccctttcgt 2820 
ctctgacaca tgcagctccc ggagacggtc 2880 
agacaagccc gtcagggcgc gtcagcgggt 2940 
gcggcatcag agcagattgt actgagagtg 3000 
tgcgtaagga gaaaataccg catcaggcgg 3060 
ataggttaat gtcatgataa taatggtttc 3120 
tgtgcgcgga acccctattt gtttattttt 3180 
gagacaataa ccctgataaa tgcttcaata 3240 
acatttccgt gtcgccctta ttcccttttt 3300 
cccagaaacg ctggtgaaag taaaagatgc 3360 
catcgaactg gatctcaaca gcggtaagat 3420 
tccaatgatg agcactttta aagttctgct 3480 
cgggcaagag caactcggtc gccgcataca 3540 
accagtcaca gaaaagcatc ttacggatgg 3600 
cataaccatg agtgataaca ctgcggccaa 3660 
ggagctaacc gcttttttgc acaacatggg 3720 
accggagctg aatgaagcca taccaaacga 3780 
ggcaacaacg ttgcgcaaac tattaactgg 3840 
attaatagac tggatggagg cggacaaagt 3900 
ggctggctgg tttattgctg acaaatctgg 3960 
tgcagcactg gggccagatg gtaagccctc 4020 
tcaggcaact atggatgaac gaaatagaca 4080 
gcattggtaa ctgtcagacc aagtttactc 4140 
tttttaattt aaaaggatct aggtgaagat 4200 
ttaacgtgag ttttcgttcc actgagcgtc 4260 
ttgagatcct ttttttctgc gcgtaatctg 4320 
agcggtggtt tgtttgccgg atcaagagct 4380 
cagcagagcg cagataccaa atactgtcct 4440 
caagaactct gtagcaccgc ctacatacct 4500 
tgccagtggc gataagtcgt gtcttaccgg 4560 
ggcgcagcgg tcgggctgaa cggggggttc 4620 
ctacaccgaa ctgagatacc tacagcgtga 4680 
gagaaaggcg gacaggtatc cggtaagcgg 4740 
gcttccaggg ggaaacgccc ggtatcttta 4800 
tgagcgtcga tttttgtgat gctcgtcagg 4860 
cgcggccttt ttacggtccc tggccttttg 4920 
gttatcccct gattctgtigg acaaccgtat 4980 
ccgcagccga acgaccgagc gcagcgagtc 5040 
acgcaaaccg cctctccccg cgcgttggcc 5100 
tcccgactgg aaagcgggca gtgagcgcaa 5160 
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cgcaattaat gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc 5220 

ggctcgtatg ttgtgtggaa ctgtgagcgg ataacaactt cacacaggaa acagctatga 5230 

ccacgattac gccaagctgt aagtttaaac atgaccttac caactaacta ttctcatcta 5340 

aatttccaga qc:taaaaac ggctgaaatc actcacaacg atggatacgc taacaactcg 5400 

gaaatgaaat aagctccatg cgtcttcttg attaccgcag atgtttggca act ca 1 c 1 1 1 5460 

acacaaaaac gtcttcattc attatttcca tcattcattt atctttctgt gtttttagtt 5520 

agttttagct agtttctttc taaattccta actttaaaaa atctggaaaa gaaaatcaaa 5580 

aaactccgtc cccattattt atcttattac tggaaaatct ccaaacagga aaacccaccg 5640 

cgtcttgcct attgctgtat ttatgaaaaa aaaaacaatc aatattggtc aagtaaataa 5700 
gaaaaaatta acgaatctct atctgacacc agatgcg 5737 

<210> 10 
<211> 6157 
<212> DNA 

<213> Artificial Sequence 
<22C> 

<223> Description of Artificial Sequence: plasmid 
pGF2013 

<40C> 10 

ctcacecaga aaccctggtg aaagtaaaag atgctgaaga tcagttgggt gcacgagtgg 60 

gttacatcga actggatctc aacagcggta agatccttga gagttttcgc cccgaagaac 120 

gttttccaat gatgagcact tttaaagttc tgctatgtgg cgcggtatta tcccgtattg 180 

acgccgggca agagcaactc ggtcgccgca tacactattc tcagaatgac ttggttgagt 240 

actcaccagt cacagaaaag catcttacgg atggcatgac agtaagagaa ttatgcagtg 300 

ctgccataac catgagtgat aacactgcgg ccaacttact tctgacaacg atcggaggac 360 

cgaaggagct aaccgctttt ttgcacaaca tgggggatca tgtaactcgc cttgatcgtt 420 

gggaaccgga gctgaatgaa gccataccaa acgacgagcg tgacaccacg atgcctgtag 480 

caatggcaac aacgttgcgc aaactattaa ctggcgaact acttactcta gcttcccggc 540 

aacaattaat agactggatg gaggcggata aagttgcagg accacttctg cgctcggccc 600 

ttccggctgg ctggtttatt gctgataaat ctggagccgg tgagcgtggg tctcgcggta 660 

tcattgcagc actggggcca gatggtaagc cctcccgtat cgtagttatc tacacgacgg 720 

ggagtcaggc aactatggat gaacgaaata gacagatcgc tgagataggt gcctcactga 780 

ttaagcattg gtaactgtca gaccaagttt actcatatat actttagatt gatttaaaac 840 

ttcattttta atttaaaagg atctaggtga agatcctttt tgataatctc atgaccaaaa 900 

tcccttaacg tgagttttcg ttccactgag cgtcagaccc cgtagaaaag atcaaaggat 960 

cttcttgaga tccttttttt ctgcgcgtaa tctgctgctt gcaaacaaaa aaaccaccgc 1020 

taccagcggt ggtttgtttg ccggatcaag agctaccaac tctttttccg aaggtaactg 1080 

gcttcagcag agcgcagata ccaaatactg tccttctagt gtagccgtag ttaggccacc 1140 

acttcaagaa ctctgtagca ccgcctacat acctcgctct gctaatcctg ttaccagtgg 1200 

ctgctgccag tggcgataag tcgtgtctta ccgggttgga ctcaagacga tagttaccgg 1260 

ataaggcgca gcggtcgggc tgaacggggg gttcgtgcac acagcccagc ttggagcgaa 1320 

cgacctacac cgaactgaga tacctacagc gtgagcattg agaaagcgcc acgcttcccg 1380 

aagggagaaa ggcggacagg tatccggtaa gcggcagggt cggaacagga gagcgcacga 1440 

gggagcttcc agggggaaac gcctggtatc tttatagtcc tgtcgggttt cgccacctct 1500 

gacttgagcg tcgatttttg tgatgctcgt caggggggcg gagcctatgg aaaaacgcca 1560 

gcaacgcggc ctttttacgg ttcctggcct tttgctggcc ttttgctcac atgttctttc 1620 

ctgcgttatc ccctgattct gtggataacc gtattaccgc ctttgagtga gctgataccg 1680 

ctcgccgcag ccgaacgacc gagcgcagcg agtcagtgag cgaggaagcg gaagagcgcc 1740 

caatacgcaa accgcctctc cccgcgcgtt ggccgattca ttaacgcagc tggcacgaca 1800 

ggtttcccga ctggaaagcg ggcagtgagc gcaacgcaat taatgtgagt tagctcactc 1860 

attaggcacc ccaggcttta cactttatgc ttccggctcg tatgttgtgt ggaattgtga 1920 

gcggataaca atttcacaca ggaaacagct atgaccatga ttacgccaag ctgtaagttt 1980 

aaacatgatc ttactaacta actattctca tttaaatttt cagagcttaa aaatggctga 2040 

aazcaczcac aacqatgqat acgctaacaa cttggaaatg aaataagctt gcatgcaagg 2100 

ttagttgcct gtttaagcat tatccccgca tgtagcttgt tcggcaccgt taaaaatgct 2160 

gagtaatcag ctttttagaa tttaaaatat taaactttta aaattgcaac aaacatcgac 2220 

aaatattcaa gaggcgaatg atatcgggaa tttcgattga aacgaaactg ttttgaaatt 2280 

caaaaagtat tttcaaagta ttgtccgcaa ggcacatcac gcaaacttgc agaatctacc 2340 


WO 00/54815 


11 


PCT/EP00/02373 


gtatcccata 
cgattaaatt 
atgagtttcc 
zgcta 1 1 1 tc 
tgt ttggcaa 
tec ttctgtg 
tctggaaaag 
caaacaggaa 
atattggtca 
tctatcccac 
gggaaaaccg 
tggtctcgtc 
ggcctccagc 
tcaacaatga 
tgcgtatttt 
cttgtcaaaa 
aaatcatgea 
aaaaatcaat 
gecaaaggae 
ggacccttgc 
tggcaccagc 
ttacgagcgt 
cattcattgg 
cttccttttt 
egggattgge 
tttcaggagg 
tttcactgga 
ttctgtcagt 
ttgeactact 
ccccgattat 
cttctcgaga 
aggttatgta 
gt c taaacag 
gtttgaaggt 
eggaaacatt 
ggcagacaaa 
ctaatctgat 
tcaactagca 
agacaaccat 
ccacatggtc 
atacaaatag 
tggtgtcaaa 
ctaacgagta 
attacccttc 
attatcaaaa 
acaatgaaat 
tgtgctccct 
cacttcttat 
cacacaaaat 
geaegtctgg 
tttggaattt 
ggggtt tccc 
aagtattgat 
gaaggtgagt 
aatgacagaa 
ggccctttcg 
eggagaeggt 
cgtcagcggg 
tactgagagt 
geatcaggeg 


catttttata 
tatttaaatg 
tttggagaaa 
gtggcgaaac 
cttattccta 
tttttagtta 
aaaac taaaa 
aacccaccgc 
agtaaa taag 
ttctctgttc 
tcaaaaaaag 
acgaatggtg 
gggcaatgtc 
ccaacttttt 
tcttttctac 
tctggatttt 
acacaacctt 
ataatgattt 
ccaaaggtat 
ttggctagca 
atccgcactg 
tgtagggggc 
tacattcatt 
cccttatct t 
caaaggaccc 
acccttgctt 
gttgtcccaa 
ggagagggtg 
ggaaaactac 
ttaaattttc 
tacccagatc 
caggaaagaa 
t teggtacta 
gatacccttg 
cttggacaca 
caaaagaatg 
ttaaattttc 
gaccattatc 
tacctgtcca 
cttcttgagt 
cat tegtaga 
aataataggg 
atatttaaat 
aacatcccta 
aaacttcttc 
tgtgtagatt 
ccccccat ta 
gtttttttta 
acct tatcat 
gcctctcatg 
ttcaatcaag 
ctattgtttg 
gagcacgatg 
agaagt tgat 
tacattccca 
tctcgcgcgt 
cacagct tgt 
tgttggcggg 
gcaccatatg 
gecttaaggg 


gtttttccct 
categttega 
agtagggat t 
ccatactcaa 
ca taaaaacg 
gttt tagcta 
aattttgtcc 
gttttgetta 
aaaaaattaa 
atetgetget 
gcaaactaaa 
agagagaatt 
ggggaaaacc 
gttacagtt t 
tttcaaaaaa 
tgaatatata 
ttattttttc 
ttttttcatt 
gtttcgaatg 
aaaatgeata 
gcagtctccg 
ggacgatagg 
tacccacctt 
tatactgtaa 
aaaggtatgt 
ggagggtacc 
ttcttgttga 
aaggtgatgc 
ctgttccatg 
agccaacact 
atatgaaacg 
ctatattttt 
actaaccata 
ttaatagaat 
aattggaata 
gaatcaaagt 
agaacttcaa 
aacaaaatac 
cacaatctgc 
ttgtaacagc 
attccaactg 
gccgctgtca 
tttcagcatc 
catgetcttt 
ttaatttctt 
caaaaataga 
ataataattc 
cttctgataa 
atgttacgt t 
aegtcaaate 
tgaaagttta 
tcaagagttt 
caagaaagat 
aat t tgaaag 
atataccaaa 
ttcggtgatg 
ctgeaagegg 
tgtegggget 
cggtgtgaaa 
ectegtgata 


cagtt 1 1 caa 
ataaaataaa 
tegect t tea 
gagctcatgc 
tttteattca 
gtttt tttct 
ctattattta 
ttgctgtatt 
cgaatctcca 
tcttttgttt 
tgcaaacacg 
ggcctccgcc 
aaactattga 
tgttataaat 
tctgctccaa 
atttttaaaa 
cgactgttgt 
ttteggaetc 
atactaacat 
aggttttget 
aaccggcctg 
tcctataggt 
cctctttctg 
tttttaactt 
ttcgaatgat 
ggtagaaaaa 
attagatggt 
aacataegga 
ggtaagttta 
tgtcactact 
gcatgacttt 
caaagatgac 
catatttaaa 
cgagttaaaa 
caactataac 
tgtaagttta 
aattagacac 
tccaattggc 
cctttcgaaa 
tgctgggat t 
agcgccggtc 
tcagagtaag 
tcgcgcccgt 
ctccctgtgc 
tgttttttag 
attaattcgt 
tatcccaaaa 
attttttttg 
tcagtt tatg 
atgetcateg 
tgaaattaat 
egaggaegge 
eggaagaagg 
tggagtagtg 
cataactgtt 
acggtgaaaa 
a tgeegggag 
ggcttaacta 
taccgcacag 
cgcctatttt 


aa t taaaaac 
gtttatcttt 
aaaaattat t 
gtcttcttga 
t ta tt tcca t 
aaattcctaa 
ttttattact 
tatgaaaaaa 
tctgacacca 
aaccagataa 
ctctatagac 
gcagagatcg 
tgagaggtat 
atgagttttg 
cctttaatgg 
ccatcaaatt 
gtattccatc 
tagaggatcc 
aacatagaac 
ggcactgttc 
caggatcgat 
cttgtatatc 
agcttctctg 
tcaggcattg 
actaacataa 
atgagtaaag 
gatgttaatg 
aaacttaccc 
aacatatata 
ttctgttatg 
ttcaagagtg 
gggaactaca 
ttttcaggtg 
ggtattgatt 
tcacacaatg 
aacatgattt 
aacattgaag 
gatggccctg 
gatcccaacg 
acacatggca 
gctaccatta 
tttaaactga 
gcctctgact 
tcccaccccc 
cttcttttaa 
aataaaaagt 
tctacacaat 
aaacatcata 
acegcaattt 
tgaaaaagtt 
tttcctgett 
gtttttcttg 
tttgggtttg 
tctatggggt 
tcctactagt 
cctctgacac 
cagacaagcc 
tgeggcatea 
atgegtaagg 
tataggttaa 


gctgaaaaag 
tgataaaaac 
tcgtgcagga 
t tactgtaga 
ca t tea 1 1 ta 
ct t taaaaaa 
ggaaaatctt 
aaaacaatca 
gatgcgaccc 
atctccctcg 
aaaatgtgtt 
cttgattatt 
cgacgaaaaa 
gatattccat 
cttttcctgt 
cagegaaatg 
aaactattca 
ccggggattg 
attttcagga 
tttatctttc 
tttttgcaaa 
atcattcatt 
gagttctgtg 
attggatccc 
catagaacat 
gagaagaact 
ggcacaaatt 
ttaaatttat 
tactaactaa 
gtgttcaatg 
ccatgcccga 
agacaegtaa 
ctgaagtcaa 
ttaaagaaga 
tatacatcat 
tactaactaa 
atggaagcgt 
tccttttacc 
aaaagagaga 
tggatgaact 
ccaacttgtc 
gttctactaa 
tctaagtcca 
tatttttgtt 
gtcacctcta 
cgaaaaaaat 
gttctgtgta 
gaaaaaaccg 
ttatttcttc 
ttggagtatt 
ttgctttttg 
ctaaaatcac 
aggctcagtg 
ttttgectta 
cggccgtacg 
atgcagctcc 
cgtcagggcg 
gagcagattg 
agaaaatacc 
tgtcatgata 


2400 
2460 
2520 
2530 
2640 
2700 
2760 
2320 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
44 40 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5380 
5940 
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ataatggttt cccagacgcc aggtggcact tttcggggaa atgtgcgcgg aacccccatt 6000 

tgtttattct tccaaacaca ttcaaatatg tatccgctca tgagacaata accctgataa 6060 

atgcctcaat aacattgaaa aaggaagagc atqag-.atcc aacacttccg tgtcgccctt 6120 

attccctttt tigcggcatt tcgccttccc gctttcg 6157 


<210> 11 
<211> 5692 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: plasmid 
pGF2014 


<400> 11 

taaacatata tatactaact aaccctgatt 
ctttctgtta tggtgttcaa tgcttctcga 
ttttcaagag tgccatgccc gaaggttatg 
acgggaacta caagacacgt aagtttaaac 
aattttcagg tgctgaagtc aagtttgaag 
aaggtattga ttttaaagaa gatggaaaca 
actcacacaa tgtatacatc atggcagaca 
taaacatgat tttactaact aactaatctg 
acaacattga agatggaagc gttcaactag 
gcgatggccc tgtcctttta ccagacaacc 
aagatcccaa cgaaaagaga gaccacatgg 
ttacacatgg catggatgaa ctatacaaat 
tcgctaccat taccaacttg tctggtgtca 
agtttaaact gagttctact aactaacgag 
gtgcctctga cttctaagtc caattactct 
gctcccaccc cctatttttg ttattatcaa 
agcttctttt aagtcacctc taacaatgaa 
gtaataaaaa gtcgaaaaaa attgtgctcc 
aatctacaca atgttctgtg tacacttctt 
tgaaacatca tagaaaaaac cgcacacaaa 
tgaccgcaat ttttatttct tcgcacgtct 
cgtgaaaaag ttttggagta tttttggaat 
attttcctgc ttttgctttt tgggggtttc 
gcgtttttct tgctaaaatc acaagtattg 
ggtttgggtt tgaggctcag tggaaggtga 
tgtctatggg gtttttgcct taaatgacag 
tttcctacta gtcggccgta cgggcccttt 
aacctctgac acatgcagct cccggagacg 
agcagacaag cccgtcaggg cgcgtcagcg 
tatgcggcat cagagcagat tgtactgaga 
agatgcgtaa ggagaaaata ccgcatcagg 
tttataggtt aatgtcatga taataatggt 
aaatgtgcgc ggaaccccta tttgtttatt 
catgagacaa taaccctgat aaatgcttca 
tcaacatttc cgtgtcgccc ttattccctt 
tcacccagaa acgctggtga aagtaaaaga 
ccacatcgaa ctggatctca acagcggtaa 
ttttccaatg atgagcactt ttaaagttct 
cgccgggcaa gagcaactcg gtcgccgcat 
ctcaccagtc acagaaaagc atcttacgga 
tgccataacc atgagtgata acactgcggc 
gaaggagcta accgcttttt tgcacaacat 
ggaaccggag ctgaatgaag ccataccaaa 
aatggcaaca acgttgcgca aactattaac 
acaattaata gactggatgg aggcggataa 


atttaaattt tcagccaaca cttgtcacta 60 
gatacccaga tcatatgaaa cggcatgact 120 
tacaggaaag aactatattt ttcaaagatg 180 
agttcggtac taactaacca tacatattta 240 
gtgataccct tgttaataga atcgagttaa 300 
ttcttggaca caaattggaa tacaactata 360 
aacaaaagaa tggaatcaaa gttgtaagtt 420 
atttaaattt ccagaacttc aaaattagac 480 
cagaccatta tcaacaaaat actccaattg 540 
attacctgtc cacacaatct gccctttcga 600 
tccttcttga gtttgtaaca gctgctggga 660 
agcattcgta gaattccaac tgagcgccgg 720 
aaaataatag gggccgctgt catcagagta 780 
taatatttaa attttcagca tctcgcgccc 840 
tcaacatccc tacatgctct ttctccctgt 900 
aaaaacttct tcttaatttc tttgtttttt 960 
attgtgtaga ttcaaaaata gaattaattc 1020 
ctccccccat taataataat tctatcccaa 1080 
atgttttttt tacttctgat aaattttttt 1140 
ataccttatc atatgttacg tttcagttta 1200 
gggcctctca tgacgtcaaa tcatgctcat 1260 
ttttcaatca agtgaaagtt tatgaaatta 1320 
ccctattgtt tgtcaagagt ttcgaggacg 1380 
atgagcacga tgcaagaaag atcggaagaa 1440 
gtagaagttg ataatttgaa agtggagtag 1500 
aatacattcc caatatacca aacataactg 1560 
cgtctcgcgc gtttcggtga tgacggtgaa 1620 
gtcacagctt gtctgtaagc ggatgccggg 1680 
ggtgttggcg ggtgtcgggg ctggcttaac 1740 
gtgcaccata tgcggtgtga aataccgcac 1800 
cggccttaag ggcctcgtga tacgcctatt 1860 
ttcttagacc tcaggtggca cttttcgggg 1920 
tttctaaata cattcaaata tgtatccgct 1980 
ataatattga aaaaggaaga gtatgagtat 2040 
ttttgcggca ttttgccttc ctgtttttgc 2100 
tgctgaagat cagttgggtg cacgagtggg 2160 
gatccttgag agttttcgcc ccgaagaacg 2220 
gctatgtggc gcggtattat cccgtattga 2280 
acactattct cagaatgact tggttgagta 2340 
tggcatgaca gtaagagaat tatgcagtgc 2400 
caacttactt ctgacaacga tcggaggacc 2460 
gggggatcat gtaactcgcc ttgatcgttg 2520 
cgacgagcgt gacaccacga tgcctgtagc 2580 
tggcgaacta cttactctag cttcccggca 2640 
agttgcagga ccacttctgc gctcggccct 2700 
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tccggctggc 
cattgcagca 
gagtcaggca 
r_aaqc3 tcgg 
tcacttttaa 
cccttaacgt 
ttcttgagat 
accagcggtg 
cttcagcaga 
cttcaagaac 
tgctgccagt 
taaggcgcag 
gacctacacc 
agggagaaag 
ggagcttcca 
acttgagcgt 
caacgcggcc 
tgcgttatcc 
tcgccgcagc 
aatacqcaaa 
gtttcccgac 
ttaggcaccc 
cggataacaa 
aacatgatct 
atcactcaca 
ttgattactg 
ccatcattca 
ctaactttaa 
tactggaaaa 
aaaaaaaaca 
accagatgcg 
ataaatctcc 
agacaaaatg 
atcgcttgat 
gtatcgacga 
tttggatatt 
atggcttttc 
aattcagcga 
catcaaacta 
atccccgggg 
gaacattttc 
gttctttatc 
cgattttttg 
tatcatcatt 
tctggagttc 
attgattgga 
ataacataga 
aaaggagaag 
aatgggcaca 
acccctaaat 


tggtttattg 
ctggggccag 
actatggatg 
taactgtcag 
t t taaaagga 
gagttttcgt 
cctttttttc 
gtttgtttgc 
gcgcagatac 
tctgtagcac 
ggcgataagt 
cggtcgggct 
gaactgagat 
gcggacaggt 
gggggaaacg 
cgatttttgt 
tttttacggt 
cctgattctg 
cgaacgaccg 
ccgcctctcc 
tggaaagcgg 
caggccttac 
tttcacacag 
taccaactaa 
acgatggata 
tagatgtttg 
tttatctttc 
aaaatctgga 
tcttcaaaca 
atcaatattg 
accctctatt 
ctcggggaaa 
tgtttggtct 
tattggcctc 
aaaatcaaca 
ccat tgcgta 
ctgtcttgtc 
aatgaaatca 
ttcaaaaaat 
attggccaaa 
aggaggaccc 
tttctggcac 
caaattacga 
cattcattca 
tgtgcttcct 
tccccgggat 
acattttcag 
aacttttcac 
aactctctgt 
ttatttgcac 


ctgataaacc 
atggtaagcc 
aacgaaatag 
accaagtt ta 
tctaggtgaa 
tccaccgagc 
tgcgcgtaat 
cggatcaaga 
caaatactgt 
cgcctacata 
cgtgtcttac 
gaacgggggg 
acctacagcg 
atccggtaag 
cctggtatct 
gatgctcgtc 
tcctggcctt 
tggataaccg 
agcgcagcga 
ccgcgcgttg 
gcagtgagcg 
actttatgct 
gaaacagcta 
ctattctcat 
cgctaacaac 
gcaacttatt 
tgtgttttta 
aaagaaaatt 
ggaaaaccca 
gtcaagtaaa 
ccacttctct 
accgtcaaaa 
cgtcacgaat 
cagtgggcaa 
atgaccaact 
tttttcttrt 
aaaatctgga 
tgtaatacaa 
caatataatg 
ggacccaaag 
ttgcttggct 
cagcatccgc 
gcgttgtagg 
ttggtacatt 
ttttccctta 
tggccaaagg 
gaggaccctt 
tggagttgtc 
cagtggagag 
tactggaaaa 


tggagccggt 
ctcccgtacc 
acagatcgct 
ctca tatata 
gatccttttt 
gtcagacccc 
ctgctgcttg 
gctaccaact 
ccttctagtg 
cctcgctctg 
cgggttggac 
ttcgtgcaca 
tgagcattga 
cggcagggtc 
ttatagtcct 
aggggggcgg 
ttgctggcct 
tattaccgcc 
gtcagtgagc 
gccgattcat 
caacgcaatt 
tccggctcgt 
tgaccatgat 
ttaaattttc 
ttggaaatga 
tttacataaa 
gttagtttta 
aaaaaatttt 
ccgcgttttg 
taagaaaaaa 
gttcatctgc 
aaaggcaaac 
ggtgagagag 
tgtcggggaa 
ttttgttaca 
ctactttcaa 
tttttgaata 
ttttttattt 
attttttttt 
gtatgtttcg 
agcaaaaatg 
actggcagtc 
gggcggacga 
catttaccca 
tctttatact 
acccaaaggt 
gcttggaggg 
ccaattcttg 
ggtgaaggtg 
ctacctgttc 


gagcgtgggt 
gtagttatct 
gagataggtg 
ctttagattg 
gataatctca 
gtagaaaaga 
caaacaaaaa 
ctttttccga 
tagccgtagt 
ctaatcctgt 
tcaagacgat 
cagcccagct 
gaaagcgcca 
ggaacaggag 
gtcgggtttc 
agcctatgga 
tttgctcaca 
tttgagtgag 
gaggaagcgg 
taatgcagct 
aatgtgagtt 
atgttgtgtg 
tacgccaagc 
agagcttaaa 
aataagcttc 
aacgttttca 
gctagttttt 
gtccctatta 
cttattgctg 
ttaacgaatc 
tgcttctttt 
taaatgcaaa 
aattggcctc 
aaccaaacta 
gttttgttat 
aaaatctgct 
tataattttt 
tttccgactg 
catttttcgg 
aatgatacta 
cataaggttt 
tccgaaccgg 
taggtcctat 
ccttcctctt 
gtaattttta 
atgtt tcgaa 
taccggtaga 
ttgaattaga 
a tgcaacata 
catgggtaag 


ctcgcggtat 
acacgacggg 
cctcactga t 
att taaaact 
tgaccaaaat 
t caaaggatc 
aaccaccgct 
aggtaactgg 
taggccacca 
taccagtggc 
agttaccgga 
tggagcgaac 
cgcttcccga 
agcgcacgag 
gccacctctg 
aaaacgccag 
tgttctttcc 
ctgataccgc 
aagagcgccc 
ggcacgacag 
agctcactca 
gaattgtgag 
tgtaagttta 
aatggctgaa 
atgcgtcttc 
ttcattattt 
ttctaaattc 
tttattttat 
tatttatgaa 
tctatctgac 
gtttaaccag 
cacgctctat 
cgccgcagag 
ttgatgagag 
aaatatgagt 
ccaaccttta 
aaaaccatca 
ttgtgtattc 
actctagagg 
acataacata 
tgctggcact 
cctgcaggat 
aggttttgta 
tctgagcttc 
actttcaggc 
tgatactaac 
aaaaatgagt 
tggtgatgtt 
cggaaaactt 
tt 
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